STARTS

Nov 28th 2016

Data Cleansing with Data Quality Services (DQS) (edX)

Created by:Delivered by:

A straightforward, no-nonsense approach to improving your data cleansing skills with SQL Server Data Quality Services (DQS). Is your data messy? Do you need to learn how to clean it up? In this computer science course, we will discuss the discipline of Data Quality Assurance and Data Quality Services (DQS).

You will learn why your data needs cleansing, the capabilities and features of DQS, what a DQS solution looks likes and how data cleansing integrates with an Integration Services (SSIS) data flow. We will demonstrate a variety of critical data quality activities such as knowledge discovery, domain management, matching policies for the de-duplication of data, reference data services, and administration topics covering installation, configuration and security.


What you'll learn:

- The discipline of Data Quality Assurance

- SQL Server DQS capabilities and features

- Creating DQS solutions

- The SSIS DQS Cleansing component

- Matching Policies and Projects

- Referencing data services in DQS

- Administering DQS


Prerequisites:

- A basic understanding of the Business Intelligence process.

- A basic understanding of database design and storage.


Suggested prerequisites / Supporting Material:
Delivering a Relational Data Warehouse
Implementing ETL with SQL Server Integration Services (SSIS)




Course Syllabus


Module 1

- Introducing DQS

Discipline of Data Quality Assurance

SQL Server DQS, overview of capabilities and features; concepts; support by SQL Server version and edition

Installation and setup

- Creating DQS Solutions

Knowledge bases; Domains; Composite domains; DQ Projects

Exploring an existing solution


Module 2

- Cleansing Data with DQS

Rules; Values

Creating a knowledge base to cleanse data in a DQ Project

- Cleansing Data with DQS in SSIS

SSIS DQS Cleansing component

Integrating cleansing in SSIS data flow

Creating knowledge base, cleanse data in a DQ Project and SSIS data flow


Module 3

- Matching Data with DQS

Matching policies; Matching projects

Creating a matching policy, and de-duplicating data in a DQ project

De-duplicating data in a DQ Project

- Referencing Data Services in DQS

Cleansing data using reference data

Validate and geocode addresses

- Administering DQS

Installation; Configuration; Security

Install, configure and secure a DQS instance


Module 4

Final Assessment