Portada » Data Engineering Capstone Project

Data Engineering Capstone Project

Curso impartido por IBM vía Coursera

Acerca del Curso

  • Data Platform Architecture and OLTP Database
    • In this module, you will design a data platform that uses MySQL as an OLTP database. You will be using MySQL to store the OLTP data.
  • Querying Data in NoSQL Databases
    • In this module, you will design a data platform that uses MongoDB as a NoSQL database. You will use MongoDB to store the e-commerce catalog data.
  • Build a Data Warehouse
    • In this module you will design and implement a data warehouse and you will then generate reports from the data in the data warehouse.
  • Data Analytics
    • In this module, you will assume the role of a data engineer at an e-commerce company. Your company has finished setting up a data warehouse. Now you are assigned the responsibility to design a reporting dashboard that reflects the key metrics of the business.
  • ETL & Data Pipelines
    • In this module, you will use the given python script to perform various ETL operations that move data from RDBMS to NoSQL, NoSQL to RDBMS, and from RDBMS, NoSQL to the data warehouse. You will write a pipeline that analyzes the web server log file, extracts the required lines and fields, transforms and loads data.
  • Big Data Analytics with Spark
    • In this module, you will use the data from a webserver to analyse search terms. You will then load a pretrained sales forecasting model and predict the sales forecast for a future year.
  • Final Submission and Peer Review
    • In this final module you will complete your submission of screenshots from the hands-on labs for your peers to review. Once you have completed your submission you will then review the submission of one of your peers and grade their submission.

 

Curso en Coursera
Universidad: IBM
Plataforma: Coursera
Precio: Gratis