Skip to content

Dashboard-Design/PythonETLPipelineProjects

Repository files navigation

PythonETLPipelineProjects πŸ“Š

Welcome to PythonETLPipelineProjects! This repository showcases my Python projects and DataCamp course progress on ETL/ELT data pipelines and data warehousing. It’s a practical resource for:

  • Aspiring data engineers πŸš€
  • Developers seeking Python data pipeline examples 🐍
  • Learners interested in data warehousing πŸ“š

Image

Contents πŸ“‹

  • Courses:
    • ETL and ELT in Python πŸŽ“
    • Data Warehousing Concepts πŸŽ“
  • Project:
    • Data Pipeline ETL with Python πŸ› οΈ

Courses Completed πŸ“š

ETL and ELT in Python πŸŽ“

  • Built performant data pipelines using Python libraries (e.g., pandas, json).
  • Covered extraction from structured/unstructured sources, transformation techniques, and loading data.
  • Explored pipeline validation, unit testing, and monitoring.

Data Warehousing Concepts πŸŽ“

  • Studied data warehouses, data marts, and data lakes.
  • Compared Inmon’s top-down and Kimball’s bottom-up approaches.
  • Mastered Kimball’s data modeling and handling slowly changing dimensions.
  • Understood OLAP vs. OLTP systems.

Prerequisites πŸ› οΈ

  • Python 3.8+
  • Libraries: pandas, sqlalchemy, numpy, logging, pytest
  • Optional: [Specify tools like PostgreSQL, Airflow, or cloud platforms if used]

Getting Started πŸš€

  1. Clone the repository:
    git clone https://github.com/[YourUsername]/PythonETLPipelineProjects.git  
  2. Install dependencies:
    pip install -r requirements.txt  
  3. Run the project:
    python Project1/main.py  

Requires Python 3.8+

License πŸ“„

MIT License - see LICENSE.

About

Build your data engineering skills with Python ETL/ELT projects and warehousing courses.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages