Data-engineering

This repository contains projects done during my Udacity nano degree course.

Projects

Data Modeling with PostgreSQL

Built a database schema and ETL pipeline for this analysis. I tested the database and ETL pipeline by running queries given to me by the analytics team from Sparkify and compared my results with their expected results.

Data Warehouse with AWS Redshift

Built an AWS Redshift database with tables designed to optimize queries.

Data Lake with Spark

Created ETL pipelines that extracts data from AWS S3, processes the data using Spark, and loads the data back into S3 as a set of dimensional tables.

Data Pipelines with Airflow

Built ETL pipelines using Airflow.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Project-1a Data Modeling with PostgreSQL		Project-1a Data Modeling with PostgreSQL
Project-2 Data Warehouse with AWS Redshift		Project-2 Data Warehouse with AWS Redshift
Project-3 Data Lake with Spark		Project-3 Data Lake with Spark
Project-4 Data Pipelines with Airflow		Project-4 Data Pipelines with Airflow
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data-engineering

Projects

Data Modeling with PostgreSQL

Data Warehouse with AWS Redshift

Data Lake with Spark

Data Pipelines with Airflow

About

Uh oh!

Releases

Packages

Languages

segunadelowo/data-engineering

Folders and files

Latest commit

History

Repository files navigation

Data-engineering

Projects

Data Modeling with PostgreSQL

Data Warehouse with AWS Redshift

Data Lake with Spark

Data Pipelines with Airflow

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages