Python and Machine-Learning for Asset Management with Alternative Data Sets (Coursera)

Offered by EDHEC Business School,
Python and Machine-Learning for Asset Management with Alternative Data Sets (Coursera)

Over-utilization of market and accounting data over the last few decades has led to portfolio crowding, mediocre performance and systemic risks, incentivizing financial institutions which are looking for an edge to quickly adopt alternative data as a substitute to traditional data. This course introduces the core concepts around alternative data, the most recent research in this area, as well as practical portfolio examples and actual applications.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

The approach of this course is somewhat unique because while the theory covered is still a main component, practical lab sessions and examples of working with alternative datasets are also key. This course is fo you if you are aiming at carreers prospects as a data scientist in financial markets, are looking to enhance your analytics skillsets to the financial markets, or if you are interested in cutting-edge technology and research as they apply to big data. The required background is: Python programming, Investment theory , and Statistics. This course will enable you to learn new data and research techniques applied to the financial markets while strengthening data science and python skills.
Course 4 of 4 in the Investment Management with Python and Machine Learning Specialization.

What You Will Learn

  • Learn what alternative data is and how it is used in financial market applications.
  • Become immersed in current academic and practitioner state-of-the-art research pertaining to alternative data applications.
  • Perform data analysis of real-world alternative datasets using Python.
  • Gain an understanding and hands-on experience in data analytics, visualization and quantitative modeling applied to alternative data in finance

Syllabus

WEEK 1
Consumption
The consumption module introduces students to the basics of consumption-based alternative data.
By aggregating online and offline consumer purchase activity and behavioral datasets including geolocation data (e.g., cell locations, satellite imagery etc.), transaction data (e.g., credit card transaction logs and point of sale data), as well as consumer interaction with brands and products on social media, researchers can learn about company performance ahead of official company earning announcements. Such information may be extremely useful and can provide investment and risk management advantages. This module reviews the theoretical aspects of various consumption datasets, and provides practical demonstrations of relevant data analytics.

WEEK 2
Textual Analysis for Financial Applications
Module 2 is an introduction to text mining as well as a demonstration of how to get from data retrieval (web scraping) to financial market insights. Some of the classic text mining methodologies are covered such as vectorization of text (the bag of words approach), stop words for filtering, and term frequency-inverse document frequency (TF-IDF). Students will learn how text can be mathematically represented, and regularized/filtered to reduce noise. Measures of text-similarity will be covered in theoretical and practice sessions. Lab sessions go through examples of web scraping data, regularizing with the described techniques and finally, insights will be derived from the textual data.

WEEK 3
Processing Corporate Filings
Module 3 is a practical extension of the text mining lessons to 10-K and 13-F, two of the most commonly researched corporate filings. This type of data can be extremely daunting when used by individual analysts due to the sheer size of the documents, but module 3 describes the methodologies for quantitatively analyzing these documents with Python code. Both the 10-K and 13-F documents are worked through, and within the lab sessions it is demonstrated how one can automatically pull this kind of data as well as define metrics around them. We investigate implementations of research in this field around similarity of given companies 10-K statements over time as well as similarity between fund holdings from the 13-F in the lab.

WEEK 4
Using Media-Derived Data
The final module introduces both sentiment analysis in the context of textual data as well as network analysis in the context of connectivity of firms. Sentiment analysis is an avenue of potentially fruitful information that when done correctly can display what a general population might believe about a company (through for example social media) or even whether the company itself is positive or negative on future outlook (through analysis of tone in corporate filings). Network analysis, as shown in the research of course instructors and his colleagues, can be used to accurately capture how a financial network is oriented and what companies might perform well because of other firm’s mentioning them as a threat. The lab session of this module extends the corporate filings analysis to examine sentiment while also introducing a set of tweets which are then transformed into a network representation.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Machine Learning: Regression (Coursera) Coursera
University of Washington

Machine Learning: Regression (Coursera)

Case Study - Predicting Housing Prices. In our first case study, predicting house prices, you will create models that predict a continuous value (price) from input features (square footage, number of bedrooms and bathrooms,...). This is just one of the many places where regression can be applied. Other applications range from predicting health outcomes in medicine, stock prices in finance, and power usage in high-performance computing, to analyzing which regulators are important for gene expression.

Jul 6th 2026
5-12 Weeks
Recommender Systems: Evaluation and Metrics (Coursera) Coursera
University of Minnesota

Recommender Systems: Evaluation and Metrics (Coursera)

In this course you will learn how to evaluate recommender systems. You will gain familiarity with several families of metrics, including ones to measure prediction accuracy, rank accuracy, decision-support, and other factors such as diversity, product coverage, and serendipity. You will learn how different metrics relate to different user goals and business goals.

Jul 6th 2026
4 Weeks
Introduction to Machine Learning (Coursera) Coursera
Duke University

Introduction to Machine Learning (Coursera)

This course will provide you a foundational understanding of machine learning models (logistic regression, multilayer perceptrons, convolutional neural networks, natural language processing, etc.) as well as demonstrate how these models can solve complex problems in a variety of industries, from medical diagnostics to image recognition to text prediction.

Jul 10th 2026
5-12 Weeks
Programming for Everybody (Getting Started with Python) (Coursera) Coursera
University of Michigan

Programming for Everybody (Getting Started with Python) (Coursera)

This course aims to teach everyone the basics of programming computers using Python. We cover the basics of how one constructs a program from a series of simple instructions in Python. The course has no pre-requisites and avoids all but the simplest mathematics. Anyone with moderate computer experience should be able to master the materials in this course.

Jul 6th 2026
5-12 Weeks
Python Data Structures (Coursera) Coursera
University of Michigan

Python Data Structures (Coursera)

This course will introduce the core data structures of the Python programming language. We will move past the basics of procedural programming and explore how we can use the Python built-in data structures such as lists, dictionaries, and tuples to perform increasingly complex data analysis. This course will cover Chapters 6-10 of the textbook “Python for Everybody”. This course covers Python 3.

Jul 6th 2026
5-12 Weeks
Using Machine Learning in Trading and Finance (Coursera) Coursera
New York Institute of Finance,Google Cloud

Using Machine Learning in Trading and Finance (Coursera)

This course provides the foundation for developing advanced trading strategies using machine learning techniques. In this course, you’ll review the key components that are common to every trading strategy, no matter how complex. You’ll be introduced to multiple trading strategies including quantitative trading, pairs trading, and momentum trading.

Jul 10th 2026
4 Weeks
Learn to code with AI (Coursera) Coursera
Scrimba

Learn to code with AI (Coursera)

Imagine waking up tomorrow as a web developer. What would you want to build? With AI tools like ChatGPT, you're already a developer, regardless of your experience, if you know how to work with them. So in this course, you'll build functional, interactive front-end projects while learning how to write effective prompts and debug and refine your code with the help of AI.

Jul 8th 2026
2 Weeks
Practical Machine Learning (Coursera) Coursera
Johns Hopkins University

Practical Machine Learning (Coursera)

One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates.

Jul 6th 2026
4 Weeks
Neural Networks and Deep Learning (Coursera) Coursera
DeepLearning.AI

Neural Networks and Deep Learning (Coursera)

If you want to break into cutting-edge AI, this course will help you do so. Deep learning engineers are highly sought after, and mastering deep learning will give you numerous new career opportunities. Deep learning is also a new "superpower" that will let you build AI systems that just weren't possible a few years ago. In this course, you will learn the foundations of deep learning.

Jul 6th 2026
4 Weeks
Code Free Data Science (Coursera) Coursera
University of California, San Diego

Code Free Data Science (Coursera)

The Code Free Data Science class is designed for learners seeking to gain or expand their knowledge in the area of Data Science. Participants will receive the basic training in effective predictive analytic approaches accompanying the growing discipline of Data Science without any programming requirements. Machine Learning methods will be presented by utilizing the KNIME Analytics Platform to discover patterns and relationships in data.

Jul 6th 2026
4 Weeks