You will learn why your data needs cleansing, the capabilities and features of DQS, what a DQS solution looks likes and how data cleansing integrates with an Integration Services (SSIS) data flow. We will demonstrate a variety of critical data quality activities such as knowledge discovery, domain management, matching policies for the de-duplication of data, reference data services, and administration topics covering installation, configuration and security.
What you'll learn:
- The discipline of Data Quality Assurance
- SQL Server DQS capabilities and features
- Creating DQS solutions
- The SSIS DQS Cleansing component
- Matching Policies and Projects
- Referencing data services in DQS
- Administering DQS
Prerequisites:
- A basic understanding of the Business Intelligence process.
- A basic understanding of database design and storage.
Suggested prerequisites / Supporting Material:
Delivering a Relational Data Warehouse
Implementing ETL with SQL Server Integration Services (SSIS)
Course Syllabus
Module 1
- Introducing DQS
Discipline of Data Quality Assurance
SQL Server DQS, overview of capabilities and features; concepts; support by SQL Server version and edition
Installation and setup
- Creating DQS Solutions
Knowledge bases; Domains; Composite domains; DQ Projects
Exploring an existing solution
Module 2
- Cleansing Data with DQS
Rules; Values
Creating a knowledge base to cleanse data in a DQ Project
- Cleansing Data with DQS in SSIS
SSIS DQS Cleansing component
Integrating cleansing in SSIS data flow
Creating knowledge base, cleanse data in a DQ Project and SSIS data flow
Module 3
- Matching Data with DQS
Matching policies; Matching projects
Creating a matching policy, and de-duplicating data in a DQ project
De-duplicating data in a DQ Project
- Referencing Data Services in DQS
Cleansing data using reference data
Validate and geocode addresses
- Administering DQS
Installation; Configuration; Security
Install, configure and secure a DQS instance
Module 4
Final Assessment