Building Batch Data Pipelines on GCP

Por: Coursera . en: , ,

  • Introduction
    • In this module, we introduce the course and agenda
  • Introduction to Batch Data Pipelines
    • This module reviews different methods of data loading: EL, ELT and ETL and when to use what
  • Executing Spark on Dataproc
    • This module shows how to run Hadoop on Dataproc, how to leverage Cloud Storage, and how to optimize your Dataproc jobs.
  • Manage Data Pipelines with Cloud Data Fusion and Cloud Composer
    • This module shows how to manage data pipelines with Cloud Data Fusion and Cloud Composer.
  • Serverless Data Processing with Dataflow
    • This module covers using Dataflow to build your data processing pipelines
  • Summary
    • This module reviews the topics covered in this course

Plataforma