KennethanCeyer / awesome-data-pipelineLinks
Awesome list for datapipeline
☆35Updated 2 years ago
Alternatives and similar repositories for awesome-data-pipeline
Users that are interested in awesome-data-pipeline are comparing it to the libraries listed below
Sorting:
- A curated list of awesome DataOps tools☆211Updated 5 months ago
- Awesome list of dataops products, open source and resources☆24Updated 3 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆139Updated 5 years ago
- A list of free datasets that provide streaming data☆426Updated last year
- New generation opensource data stack☆76Updated 3 years ago
- This is a repo with links to everything you'd ever want to learn about data engineering☆10Updated last year
- A curated list of open source tools used in analytics platforms and data engineering ecosystem☆406Updated 9 months ago
- Apache Spark Guide☆33Updated 3 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 3 years ago
- How to build an awesome data engineering team☆100Updated 6 years ago
- Apache Airflow Guide☆31Updated last year
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆38Updated 3 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 3 years ago
- A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.☆32Updated 2 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Updated 3 years ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆44Updated last year
- This is a demo streaming project simulating a music streaming service.☆34Updated last year
- Spark data pipeline that processes movie ratings data.☆30Updated this week
- Challenge Data Engineer☆25Updated 3 years ago
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆91Updated 2 years ago
- A curated list of awesome blogs, videos, tools and resources about Data Contracts☆180Updated last year
- ☆75Updated 2 weeks ago
- 📙 Awesome Data Catalogs and Observability Platforms.☆952Updated 3 months ago
- Auto-generated Diagrams from Airflow DAGs. 🔮 🪄☆354Updated this week
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆91Updated last year
- ☆12Updated 3 years ago
- Data Glossary 🧠: An interactive digital garden for deeper data exploration. Learn through a graph and backlinks, enabling layered knowle…☆113Updated 2 years ago
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆124Updated last month
- All of my recommendations for aspiring engineers in a single place, coming from various areas of interest.☆158Updated 3 weeks ago