Building a Datalake with Your Data

Overview

In this workshop, we will be building a serverless data pipeline using your own data. Starting with your sample data stored in Amazon S3, we will go through various parts of the workshop focusing on transforming, analyzing, and visualizing your data.

Upon completion of the workshop, you will have a solid foundation to further develop your data pipeline and generate more insights. We will leverage AWS Glue for data cataloguing and running ETL on data in the data lake. Amazon Athena will be used for querying data in the data lake and Amazon QuickSight for data visualization.

In this workshop, we will be using the Singapore Region (ap-southeast-1), but you may choose a different region as you prefer.

Preparation
Data Preparation
Ingestion with Glue
Building the data pipeline
Querying with Athena
Visualization with QuickSight
Resource Cleanup

Building a Datalake with Your Data

Overview

Contents