angelddaz / de-challengesLinks
Project based learning for Data Engineering fundamentals.
☆13Updated 4 years ago
Alternatives and similar repositories for de-challenges
Users that are interested in de-challenges are comparing it to the libraries listed below
Sorting:
- pyspark dataframe made easy☆16Updated 4 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆87Updated 3 years ago
- Code and notebooks containing my experiments in data science, EDA, visualization, and machine learning☆27Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆18Updated 7 years ago
- ☆31Updated 2 years ago
- All the code related to building my own data lake☆21Updated 2 years ago
- A repo to track data engineering projects☆13Updated 3 years ago
- Big Data Demystified meetup and blog examples☆31Updated last year
- Techniques for Scraping the Web in Python☆26Updated 7 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 4 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆140Updated 5 years ago
- Learn data science with Python☆25Updated last year
- Content related to Mastering Postgresql along with videos.☆18Updated 4 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 5 years ago
- Jupyter Notebook and Python business intelligence tools and techniques. [Raw upload]☆85Updated 2 years ago
- Code snippets and tools published on the blog at lifearounddata.com☆12Updated 5 years ago
- Best practices for engineering ML pipelines.☆36Updated 3 years ago
- The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-m …☆14Updated 4 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 5 years ago
- Dashboard in Python with Jupyter Notebook☆44Updated 3 years ago
- Simple samples for writing ETL transform scripts in Python☆24Updated this week
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 3 years ago
- Template for Data Engineering and Data Pipeline projects☆115Updated 2 years ago
- Challenge for those applying to the Software Engineer, Big Data position☆35Updated 14 years ago
- Airflow Tutorials☆25Updated 4 years ago
- Material for Talk Python Training course on Getting Started with Dask.☆30Updated 3 years ago
- Runnable e-commerce mini data warehouse based on Python, PostgreSQL & Metabase, template for new projects☆29Updated 4 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆14Updated 4 years ago
- Data lake, data warehouse on GCP☆57Updated 3 years ago