AI Workflow: Business Priorities and Data Ingestion (Coursera)

Offered by IBM,
AI Workflow: Business Priorities and Data Ingestion (Coursera)

This is the first course of a six part specialization. You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones. This first course in the IBM AI Enterprise Workflow Certification specialization introduces you to the scope of the specialization and prerequisites.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Specifically, the courses in this specialization are meant for practicing data scientists who are knowledgeable about probability, statistics, linear algebra, and Python tooling for data science and machine learning. A hypothetical streaming media company will be introduced as your new client. You will be introduced to the concept of design thinking, IBMs framework for organizing large enterprise AI projects. You will also be introduced to the basics of scientific thinking, because the quality that distinguishes a seasoned data scientist from a beginner is creative, scientific thinking. Finally you will start your work for the hypothetical media company by understanding the data they have, and by building a data ingestion pipeline using Python and Jupyter notebooks.
By the end of this course you should be able to:

  1. Know the advantages of carrying out data science using a structured process
  2. Describe how the stages of design thinking correspond to the AI enterprise workflow
  3. Discuss several strategies used to prioritize business opportunities
  4. Explain where data science and data engineering have the most overlap in the AI workflow
  5. Explain the purpose of testing in data ingestion
  6. Describe the use case for sparse matrices as a target destination for data ingestion
  7. Know the initial steps that can be taken towards automation of data ingestion pipelines

Course 1 of 6 in the IBM AI Enterprise Workflow Specialization.

Syllabus

WEEK 1
IBM AI Enterprise Workflow Introduction
The goal of this first module is to introduce you to the overall specialization requirements, evaluate your understanding of some key prerequisite knowledge, and familiarize you with several process models commonly used today. In this course we will use the process of design thinking, but it is the consistent application of a process in practice that is important, not the exact process itself. There are a number of reasons for choosing the design thinking process, but the most important is that it is being applied in a cross-disciplinary way—that is outside of data science.
Data Collection
Throughout this module you will learn or reinforce what you already know about identifying and articulating business opportunities. In this module you will learn the importance of applying a scientific thought process to the task of understanding the business use case. This process has many similarities to that of being an investigator. You will also generate a healthy respect for the need to pause, step back and think scientifically about the main processes in this stage.

WEEK 2
Data Ingestion
Cleaning, parsing, assembling and gut-checking data is among the most time-consuming tasks that a data scientist has to perform. The time spent on data cleaning can start at 60% and increase depending on data quality and the project requirements. This module looks at the process of ingesting data and presents a case study working a real world scenario.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Exploratory Data Analysis (Coursera) Coursera
Johns Hopkins University

Exploratory Data Analysis (Coursera)

This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data.

Jun 22nd 2026
4 Weeks
Machine Learning: Classification (Coursera) Coursera
University of Washington

Machine Learning: Classification (Coursera)

Case Studies: Analyzing Sentiment & Loan Default Prediction. In our case study on analyzing sentiment, you will create models that predict a class (positive/negative sentiment) from input features (text of the reviews, user profile information,...). In our second case study for this course, loan default prediction, you will tackle financial data, and predict when a loan is likely to be risky or safe for the bank.

Jun 22nd 2026
5-12 Weeks
Generative AI Essentials: Overview and Impact (Coursera) Coursera
University of Michigan

Generative AI Essentials: Overview and Impact (Coursera)

With the rise of generative artificial intelligence, there has been a growing demand to explore how to use these powerful tools not only in our work but also in our day-to-day lives. Generative AI Essentials: Overview and Impact introduces learners to large language models and generative AI tools, like ChatGPT. In this course, you’ll explore generative AI essentials, how to ethically use artificial intelligence, its implications for authorship, and what regulations for generative AI could look like.

Jun 26th 2026
1 Week
Navigating Generative AI: A CEO Playbook (Coursera) Coursera
Coursera Instructor Network

Navigating Generative AI: A CEO Playbook (Coursera)

Created by Coursera's CEO, this course is your key to unlocking the transformative power of GenAI. It features hands-on labs with access to Google Gemini Pro in a secure, private environment. These labs not only teach you how to use GenAI, but also how to apply it to design your GenAI strategy, identify specific opportunities to enhance customer value, increase productivity, and navigate risks and ethical considerations in order to move quickly but safely in adopting generative AI.

Jun 25th 2026
5-12 Weeks
Preparing for the Google Cloud Professional Data Engineer Exam (Coursera) Coursera
Google Cloud

Preparing for the Google Cloud Professional Data Engineer Exam (Coursera)

From the course: "The best way to prepare for the exam is to be competent in the skills required of the job." This course uses a top-down approach to recognize knowledge and skills already known, and to surface information and skill areas for additional preparation. You can use this course to help create your own custom preparation plan. It helps you distinguish what you know from what you don't know. And it helps you develop and practice skills required of practitioners who perform this job.

Jun 27th 2026
5-12 Weeks
Machine Learning Foundations: A Case Study Approach (Coursera) Coursera
University of Washington

Machine Learning Foundations: A Case Study Approach (Coursera)

Do you have data and wonder what it can tell you? Do you need a deeper understanding of the core ways in which machine learning can improve your business? Do you want to be able to converse with specialists about anything from regression and classification to deep learning and recommender systems? In this course, you will get hands-on experience with machine learning from a series of practical case-studies.

Jun 22nd 2026
5-12 Weeks
Introduction to Machine Learning (Coursera) Coursera
Duke University

Introduction to Machine Learning (Coursera)

This course will provide you a foundational understanding of machine learning models (logistic regression, multilayer perceptrons, convolutional neural networks, natural language processing, etc.) as well as demonstrate how these models can solve complex problems in a variety of industries, from medical diagnostics to image recognition to text prediction.

Jun 26th 2026
5-12 Weeks
The Data Scientist's Toolbox (Coursera) Coursera
Johns Hopkins University

The Data Scientist's Toolbox (Coursera)

In this course you will get an introduction to the main tools and ideas in the data scientist's toolbox. The course gives an overview of the data, questions, and tools that data analysts and data scientists work with. There are two components to this course. The first is a conceptual introduction to the ideas behind turning data into actionable knowledge. The second is a practical introduction to the tools that will be used in the program like version control, markdown, git, GitHub, R, and RStudio.

Jun 22nd 2026
4 Weeks