Skip to content

Welcome to Amazon EMR Train-The-Trainer Workshop

This workshop contains exercises for the 3-day Amazon EMR Workshop Training.

Exercises

Part 1

  1. Launch EMR cluster with Managed Scaling and Fleets
  2. Run Spark workloads on Amazon EMR
  3. Orchestrate EMR Steps using AWS Step Functions

Part 2

  1. Amazon EMR Studio Dive Deep
  2. Orchestrate notebook pipelines using Amazon EMR Studio and Amazon MWAA
  3. Integrate with Amazon Sagemaker Studio

Part 3

  1. Build transactional data lakes using Apache Hudi
  2. Build transactional data lakes using Apache Iceberg

Part 4

  1. Run Spark workloads using Amazon EMR on EKS
  2. Run Spark and Hive workloads on EMR Serverless (preview)