Skip to content

shlbatra/Udacity_DataEngineering_Projects

Repository files navigation

Data Engineering Nanodegree

You can check more about the nanodegree program out here: https://www.udacity.com/course/data-engineer-nanodegree--nd027

Purpose of this repository

Here you can take a look at all my exercise notebooks made throughout the nanodegree courses.

Also, you encounter the list of the projects developed throughout this course down below.

Courses Projects

1. Data Modeling Course

Project 1: Data Modeling with Postgres: Sparkify song play logs ETL process
Project 2: Data Modeling with Apache Cassandra: Sparkify song play logs ETL process

2. Cloud Data Warehouses

Project 3: Data Warehouse with AWS Redshift: Sparkify - ETL process of song play events

3. Data Lakes with Spark

Project 4: Sparkify's Data Lake ELT process

4. Data Pipelines with Airflow

Project 5: Sparkify's Event Logs Data Pipeline

5. Capstone Project

Travel to US: a simple and unified dataset with immigration from around the globe to US.

About

Projects using technologies such as PostGres, S3, Spark, Cassandra & Airflow

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published