Course subjects
Module 1: Introduction to data lakes
Describe the value of data lakes
Compare data lakes and data warehouses
Describe the components of a data lake
Recognise common architectures built on data lakes
Module 2: Data ingestion, cataloging, and preparation
Describe the relationship between data lake storage and data ingestion
Describe AWS Glue crawlers and how they are used to create a data catalog
Identify data formatting, partitioning, and compression for efficient storage and query
Module 3: Building a Data Lake with AWS Lake Formation
Recognise how data processing applies to a data lake
Use AWS Glue to process data within a data lake
Describe how to use Amazon Athena to analyse data in a data lake
Lab 01: Building a Data Lake with AWS Lake Formation
Module 4: Data Processing and Analysis
Describe the features and benefits of AWS Lake Formation
Use AWS Lake Formation to create a data lake
Understand the AWS Lake Formation security model
Lab 2: Build a data lake using AWS Lake Formation
Module 5: Additional Lake Formation configurations
Explain the available built-in Blueprints to create and populate a new Lake Formation
Describe methods for applying advanced permissions to secure data access and workflow
Describe fine-grained row/cell access control
Explain the Lake Formation Tag-based access control mechanism and the different use cases for Named access control vs. Tag-based access control
Describe access flow that enforces fine-grained access policies to both catalog metadata and underlying data resource for analytics services connecting to Lake Formation
Module 6: Modern Data Architecture
Explain capabilities of a modern data architecture: Scalable data lakes, Purpose-built analytics services, Seamless data movement, unified governance, and performance and cost-effectiveness
Articulate the typical data movement within a modern data architecture: Inside out, Outside in, Around the perimeter, and Sharing across
Describe focus of building and maintaining data products as a service
Describe a typical Data Mesh architecture using Lake Formation and the key enablers supporting this methodology
Lab 3: Building and publishing a data product in Lake Formation
Module 7: Course Wrap Up
Please note: This is an emerging technology course. Course outline is subject to change as needed.