EdX

Feature Engineering for Improving Learning Environments (edX)

Feature Engineering for Improving Learning Environments (edX)

Every model used to predict a future outcome depends upon the quality of features used. This course focuses on developing better features to create better models. How can data-intensive research methods be used to create more equitable and effective learning environments? In this course, you will learn how data from digital learning environments and administrative data systems can be used to help better understand relevant learning environments, identify students in need of support, and assess changes made to learning environments.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

This course pays particular attention to the ways in which researchers and data scientists can transform raw data into features (i.e., variables or predictors) used in various machine learning algorithms. We will provide strategies for using prior research, knowledge from practice, and logic to create features, as well as build and evaluate machine learning models. The process of building features will be discussed within a broader data-intensive research workflow using R.

What you'll learn

  • How to transform and visualize data using R
  • How to apply selected machine learning algorithms (e.g., logistic regression and decision trees) to regression and classification tasks in R
  • Strategies for applying data-intensive research workflows for feature engineering and model building

Syllabus

Week 1: Finding features
Introduction to setting up a feature engineering workflow, which includes identifying problems of practice, relevant research, and brainstorming potential features.

Week 2: Data wrangling and visualization
Introduction to data wrangling, data visualization techniques, and structure discovery algorithms. Integrating theory, knowledge from practice, logic, and contextual factors into feature engineering will also be discussed.

Week 3: Modeling features
Introduction to using features within selected machine learning algorithms (e.g. logistic regression and decision tree) and the tradeoffs between interpretability and prediction.

Prerequisites
We highly recommend that you take the previous course in this series before beginning this course:
Predictive Modeling in Learning Analytics
This course is intended for those who have a bachelor’s degree and are interested in developing learning and data science skills for employment in education, corporate, nonprofit, and military sectors. Experience with programming and statistics will be beneficial to participants.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Aplicaciones de la Teoría de Grafos a la vida real II (edX) EdX
Universitat Politècnica de València,UPValenciaX

Aplicaciones de la Teoría de Grafos a la vida real II (edX)

Aprenderemos a modelizar problemas del mundo real mediante su representación con grafos y a resolverlos mediante sus algoritmos asociados. Este curso trata la Teoría de Grafos desde el punto de vista de la modelización, lo que nos permitirá con posterioridad resolver muchos problemas de diversa índole. Presentaremos ejemplos de los distintos problemas en un contexto real, analizaremos la representación de éstos mediante grafos y veremos los algoritmos necesarios para resolverlos.

Self Paced
Self-Paced
Statistical Inference and Modeling for High-throughput Experiments (edX) EdX
HarvardX,Harvard University

Statistical Inference and Modeling for High-throughput Experiments (edX)

A focus on the techniques commonly used to perform statistical inference on high throughput data. In this course you’ll learn various statistics topics including multiple testing problem, error rates, error rate controlling procedures, false discovery rates, q-values and exploratory data analysis. We then introduce statistical modeling and how it is applied to high-throughput data. In particular, we will discuss parametric distributions, including binomial, exponential, and gamma, and describe maximum likelihood estimation.

Self Paced
Self-Paced
Análisis de datos: Llévalo al MAX() (edX) EdX
Delft University of Technology,DelftX

Análisis de datos: Llévalo al MAX() (edX)

Incrementa tus habilidades de análisis de datos utilizando hojas de cálculo y visualización de datos en Excel. Aumenta tu productividad y produce mejores decisiones de negocio. Este curso de análisis de datos (business intelligence: BI) y estadísticas es para todos aquellos que quieren mejorar sus habilidades en el análisis de datos. ¿Buscas una forma inteligente de visualizar los datos para que tengan sentido? ¿Quieres entender esa colección de datos loca que te dio tu jefe? ¿Tienes Megabytes de sensores de datos para analizar? ¡No te preocupes, lo tenemos cubierto!

Self Paced
Self-Paced
Cluster Analysis (edX) EdX
University of Texas at Arlington,UTArlingtonX

Cluster Analysis (edX)

Learn how to conduct a cluster analysis to discover important patterns in student behavior using the popular Weka data mining toolkit. In this course, you will learn the basics of cluster analysis, one of the most popular data mining methods for the discovery of patterns in learning data, and its application in learning analytics.

No sessions available
3 Weeks
Data, Models and Decisions in Business Analytics (edX) EdX
Columbia University,ColumbiaX

Data, Models and Decisions in Business Analytics (edX)

Learn fundamental tools and techniques for using data towards making business decisions in the face of uncertainty. In today’s world, managerial decisions are increasingly based on data-driven models and analysis using statistical and optimization methods that have dramatically changed the way businesses operate in most domains including service operations, marketing, transportation, and finance.

This course is archived
5-12 Weeks
Learning Analytics Fundamentals (edX) EdX
University of Texas at Arlington,UTArlingtonX

Learning Analytics Fundamentals (edX)

Learn about the growing field of learning analytics and how to analyze basic data sets to generate insights. The demand for data science and learning science skills has continued to increase as classrooms, labs, and organizations look to optimize their data and improve learning environments for students and employees. The UTArlingtonX Learning Analytics courses will give you the opportunity to gain invaluable knowledge and expertise in this growing field.

No sessions available
4 Weeks
Datos para la efectividad de las políticas públicas (edX) EdX
Inter-American Development Bank - IDB,IDBx

Datos para la efectividad de las políticas públicas (edX)

Este curso te ayudará a tomar el control de los datos y familiarizarte con las herramientas para utilizarlos en la planificación, gestión y evaluación de políticas publicas. En esta era de la información, los datos están disponibles en todos lados y crecen a una tasa exponencial. ¿Cómo podemos darles sentido a todos los datos y aprovecharlos en el momento de tomar decisiones?, ¿cómo los utilizamos para que nos ayuden a guiar la gestión y planificación de nuestras políticas? Tanto si eres ciudadano como planificador de políticas, deberías poder responder a estas preguntas.

Self Paced
Self-Paced
Data Science Essentials (edX) EdX
Microsoft

Data Science Essentials (edX)

Explore data visualization and exploration concepts with experts from MIT and Microsoft, and get an introduction to machine learning. Demand for data science talent is exploding. Develop your career as a data scientist, as you explore essential skills and principles with experts from MIT and Microsoft. In this data science course, you will learn key concepts in data acquisition, preparation, exploration, and visualization. Plus, look at examples of how to build a cloud data science solution using Azure Machine Learning, R, and Python.

Not Available
Course Not Available
Data Science: Wrangling (edX) EdX
HarvardX,Harvard University

Data Science: Wrangling (edX)

Learn to process and convert raw data into formats needed for analysis. In this course, we cover several standard steps of the data wrangling process like importing data into R, tidying data, string processing, HTML parsing, working with dates and times, and text mining. Rarely are all these wrangling steps necessary in a single analysis, but a data scientist will likely face them all at some point.

Self Paced
Self-Paced