Data Warehouse Concepts, Design, and Data Integration

Por: Coursera . en: ,

  • Data Warehouse Concepts and Architectures
    • Module 1 introduces the course and covers concepts that provide a context for the remainder of this course. In the first two lessons, you’ll understand the objectives for the course and know what topics and assignments to expect. In the remaining lessons, you will learn about historical reasons for development of data warehouse technology, learning effects, business architectures, maturity models, project management issues, market trends, and employment opportunities. This informational module will ensure that you have the background for success in later modules that emphasize details and hands-on skills.You should also read about the software requirements in the lesson at the end of module 1. I recommend that you try to install the software this week before assignments begin in week 2.
  • Multidimensional Data Representation and Manipulation
    • Now that you have conceptual background for data warehouse development, you’ll start using data warehouse tools! In module 2, you will learn about the multidimensional representation of a data warehouse used by business analysts. You’ll apply what you’ve learned in practice and graded problems using WebPivotTable, a web-based tool for manipulating pivot tables. At the end of this module, you will have solid background to communicate and assist business analysts who use a multidimensional representation of a data warehouse. To complete this module, you should proceed to the assignment and quiz involving WebPivotTable.
  • Data Warehouse Design Practices and Methodologies
    • This module emphasizes data warehouse design skills. Now that you understand the multidimensional representation used by business analysts, you are ready to learn about data warehouse design using a relational database. In practice, the multidimensional representation used by business analysts must be derived from a data warehouse design using a relational DBMS.You will learn about design patterns, summarizability problems, and design methodologies. You will apply these concepts to mini case studies about data warehouse design. At the end of the module, you will have created data warehouse designs based on data sources and business needs of hypothetical organizations.
  • Data Integration Concepts, Processes,and Techniques
    • Module 4 extends your background about data warehouse development. After learning about schema design concepts and practices, you are ready to learn about data integration processing to populate and refresh a data warehouse. The informational background in module 4 covers concepts about data sources, data integration processes, and techniques for pattern matching and inexact matching of text. Module 4 provides a context for the software skills that you will learn in module 5.
  • Architectures, Features, and Details of Data Integration Tools
    • Module 5 extends your background about data integration from module 4. Module 5 covers architectures, features, and details about data integration tools to complement the conceptual background in module 4. You will learn about the features of two open source data integration tools, Talend Open Studio and Pentaho Data Integration. You will use Pentaho Data Integration in a guided tutorial in preparation for a graded assignment involving Pentaho Data Integration. For the tutorial and assignment, you need to connect to a database server, Oracle, MySQL, or PostgreSQL. I recommend usage of PostgreSQL as it is easy to install on all platforms as well as widely used. In addition, PostgreSQL can be used in Course 3 and the capstone course. You should see Module 1 with installation Instructions for Pentaho Data Integration and these database servers.