Data Engineering Roadmap
Data Engineering Roadmap
---
---
NoSQL Databases:
- MongoDB (Document Store)
- Redis (Key-Value Store)
- Apache Cassandra (Wide-Column Store)
🛠 Hands-on:
- Set up PostgreSQL & MongoDB locally
- Design a simple database schema
---
🛠 Hands-on:
- Design a star schema for an e-commerce dataset
- Load & query data in BigQuery
---
🛠 **Hands-on:**
- Build an Airflow DAG to extract data from an API and store it in a database
---
🛠 **Hands-on:**
- Stream real-time tweets using Kafka and process them with Spark
---
🛠 **Hands-on:**
- Set up an AWS Glue job to process data from S3 and load it into Redshift
---
🛠 **Hands-on:**
- Create a CI/CD pipeline for deploying an Airflow DAG
---
#### Intermediate:
- Process streaming Twitter data with Kafka & Spark
- Implement a data warehouse using BigQuery
#### Advanced:
- Build a full-scale real-time analytics pipeline
- Design a cloud-based data lakehouse using AWS
---
---