Coursera

Generative AI Language Modeling with Transformers (Coursera)

Offered by IBM,

This course provides you with an overview of how to use transformer-based models for natural language processing (NLP). In this course, you will learn to apply transformer-based models for text classification, focusing on the encoder component. You’ll learn about positional encoding, word embedding, and attention mechanisms in language transformers and their role in capturing contextual information and dependencies.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Additionally, you will be introduced to multi-head attention and gain insights on decoder-based language modeling with generative pre-trained transformers (GPT) for language translation, training the models, and implementing them in PyTorch.
Further, you’ll explore encoder-based models with bidirectional encoder representations from transformers (BERT) and train using masked language modeling (MLM) and next sentence prediction (NSP).
Finally, you will apply transformers for translation by gaining insight into the transformer architecture and performing its PyTorch implementation.
The course offers practical exposure with hands-on activities that enables you to apply your knowledge in real-world scenarios.
This course is part of a specialized program tailored for individuals interested in Generative AI engineering.
This course requires a working knowledge of Python, PyTorch, and machine learning.

What you'll learn

Explain the concept of attention mechanisms in transformers, including their role in capturing contextual information.
Describe language modeling with the decoder-based GPT and encoder-based BERT.
Implement positional encoding, masking, attention mechanism, document classification, and create LLMs like GPT and BERT.
Use transformer-based models and PyTorch functions for text classification, language translation, and modeling.

Syllabus

Fundamental Concepts of Transformer Architecture
In this module, you will learn the techniques to achieve positional encoding and how to implement positional encoding in PyTorch. You will learn how attention mechanism works and how to apply attention mechanism to word embeddings and sequences. You will also learn how self-attention mechanisms help in simple language modeling to predict the token. In addition, you will learn about scaled dot-product attention mechanism with multiple heads and how the transformer architecture enhances the efficiency of attention mechanisms. You will also learn how to implement a series of encoder layer instances in PyTorch. Finally, you will learn how to use transformer-based models for text classification, including creating the text pipeline and the model and training the model.

Advanced Concepts of Transformer Architecture
In this module, you will learn about decoders and GPT-like models for language translation, train the models, and implement them using PyTorch. You will also gain knowledge about encoder models with Bidirectional Encoder Representations from Transformers (BERT) and pretrain them using masked language modeling (MLM) and next sentence prediction (NSP). You will also perform data preparation for BERT using PyTorch. Finally, you learn about the applications of transformers for translation by understanding the transformer architecture and performing its PyTorch Implementation. The hands-on labs in this module will give you good practice in how you can use the decoder model, encoder model, and transformers for real-world applications.

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

DeepLearning.AI

AI For Everyone (Coursera)

Business

AI is not only for engineers. If you want your organization to become better at using AI, this is the course to tell everyone--especially your non-technical colleagues--to take.

Aug 10th 2026

4 Weeks

Artificial Intelligence Machine Learning Neural Networks

Coursera

University of Colorado Boulder

Ethical Issues in Data Science (Coursera)

Statistics & Data Analysis Data Science

Computing applications involving large amounts of data – the domain of data science – impact the lives of most people in the U.S. and the world. These impacts include recommendations made to us by internet-based systems, information that is available about us online, techniques that are used for security and surveillance, data that is used in health care, and many more. In many cases, they are affected by techniques in artificial intelligence and machine learning.

Aug 3rd 2026

5-12 Weeks

Philosophy Artificial Intelligence Security

Coursera

Google Cloud

Introduction to Generative AI Studio (Coursera)

CS: Software Engineering

This course introduces Generative AI Studio, a product on Vertex AI, that helps you prototype and customize generative AI models so you can use their capabilities in your applications. In this course, you learn what Generative AI Studio is, its features and options, and how to use it by walking through demos of the product. In the end, you will have a quiz to test your knowledge.

Aug 3rd 2026

1 Week

Artificial Intelligence AI Coursera Plus

Coursera

Coursera Instructor Network

User Awareness and Education for Generative AI (Coursera)

Data Science

This course aims to empower general users with a friendly and non-technical understanding of Generative AI. It emphasizes the importance of transparency in AI systems, helping learners to comprehend how AI decisions are made.

Aug 3rd 2026

1 Week

Artificial Intelligence Decision Making AI

Coursera

University of Leeds

How to Get Into AI (Coursera)

CS: Design & Product

AI forms the basis for all computer learning and is the future of all complex decision-making. As such it is becoming increasingly prevalent in daily life. As a result, the outlook is bright for artificial intelligence jobs. In this course, you will learn how to navigate the dynamic field of artificial intelligence (AI), exploring its applications and the evolving landscape of AI-related careers.

Aug 3rd 2026

2 Weeks

Artificial Intelligence Workplace Career Development

Coursera

Fractal Analytics

Responsible AI - Principles and Ethical Considerations (Coursera)

Statistics & Data Analysis Data Science

Welcome to "Responsible AI – Principles and Ethical Considerations"! Dive deep into the very essence of Responsible AI with us. Uncover the significance of key principles shaping technology's future. From ethical considerations to fairness, transparency, and accountability, we discuss these principles with real-world examples, putting them into the context of data science.

Aug 10th 2026

5-12 Weeks

Artificial Intelligence Security Privacy

Coursera

Coursera Instructor Network

Leveraging AI for Enhanced Content Creation (Coursera)

Data Science

This course provides a foundation to assess, and apply, a series of Generative Artificial Intelligence (AI) tools, such as ChatGPT, Bing Chat, Google Bard, Midjourney, Runway, and Eleven Labs. This learning opportunity offers a hands-on experience through ideating, creating, and finalizing a mock advertising campaign using the combined strengths of these AI tools.

Aug 10th 2026

1 Week

Artificial Intelligence AI Runway

Coursera

Institut Mines-Telecom

Data intelligence for businesses and managers (Coursera)

Statistics & Data Analysis Data Science

With the proliferation of connected objects (computers, tablets, watches, etc.), huge masses of data are generated every second. This Big Data has led to the emergence of a data economy, where data is the main source of competitive advantage for companies. In this sense, data and its processing tools have become a strategic priority for companies, and the main gas pedal of their digital transformation.

Aug 10th 2026

5-12 Weeks

Artificial Intelligence Data Management Business Strategy

Coursera

AWS

Amazon Bedrock - Getting Started (Coursera)

Data Science

Amazon Bedrock is a fully managed service that makes foundation models (FMs) from Amazon and leading artificial intelligence (AI) startups available through an API. In this course, you will learn the benefits of Amazon Bedrock. You will learn how to start using the service through a demonstration in the Amazon Bedrock console. You will also learn about the AI concepts of Amazon Bedrock and how you can use the service to accelerate development of generative AI applications.

Aug 3rd 2026

1 Week

Artificial Intelligence AI AWS

Coursera

University of Virginia

Artificial Intelligence in Marketing (Coursera)

Marketing & Communication Business

AI is everywhere! By harnessing the power of Artificial Intelligence, businesses and marketers have amazing growth potential, and the opportunities to enhance marketing with AI are always expanding. But how can businesses use AI tools to drive their success and gain sustainable competitive advantages? What are the challenges faced by businesses as they implement AI into their marketing strategies? In this course, developed at the Darden School of Business at the University of Virginia, and delivered by Professor of Business Administration Raj Venkatesan, you will explore an important frontier of digital transformation in marketing.

Aug 3rd 2026

4 Weeks

Business Marketing Artificial Intelligence

Coursera

Politecnico di Milano

Artificial Intelligence and legal issues (Coursera)

Robotics & Computer Vision

The purpose of the course is to help students understand the legal implications related to the design and use of artificial intelligence systems, providing an overview of the risks and legal protections that can be envisaged and giving an overview of the legislation and legal principles currently applicable on the subject.

Aug 10th 2026

4 Weeks

Artificial Intelligence Law Intellectual Property

Coursera

Korea Advanced Institute of Science and Technology - KAIST

AI Materials (Coursera)

CS: Design & Product

Learn about the materials that have advanced the performance of artificial intelligence, and the machine learning models that could help accelerate the design and development of novel materials. This course defines artificial intelligence (AI) as a machine to which some or all of the functions of the human brain have been delegated. It highlights the need, and explains in an easy-to-understand way how machine learning from artificial intelligence can dramatically accelerate the development of new materials.

Aug 10th 2026

5-12 Weeks

Artificial Intelligence Machine Learning AI