Portada » Building Batch Data Pipelines on Google Cloud

Building Batch Data Pipelines on Google Cloud

Curso impartido por Johns Hopkins University vía Coursera

Acerca del Curso

  • Introduction
    • In this module, we introduce the course and agenda
  • Introduction to Building Batch Data Pipelines
    • This module reviews different methods of data loading: EL, ELT and ETL and when to use what
  • Executing Spark on Dataproc
    • This module shows how to run Hadoop on Dataproc, how to leverage Cloud Storage, and how to optimize your Dataproc jobs.
  • Serverless Data Processing with Dataflow
    • This module covers using Dataflow to build your data processing pipelines
  • Manage Data Pipelines with Cloud Data Fusion and Cloud Composer
    • This module shows how to manage data pipelines with Cloud Data Fusion and Cloud Composer.
  • Course Summary
    • Course Summary

 

Curso en Coursera
Universidad: Johns Hopkins University
Plataforma: Coursera
Precio: Gratis