π¦ Batch data pipeline with Airflow, DuckDB, Delta Lake, Trino, MinIO, and Metabase. Full observability and data quality.
β89Nov 5, 2025Updated 7 months ago
Alternatives and similar repositories for batch-data-pipeline
Users that are interested in batch-data-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β12Oct 12, 2023Updated 2 years ago
- Question and Answer application using AWS Bedrock, AWS ECS, Langchain, Qdrant, and FastAPIβ15Feb 27, 2024Updated 2 years ago
- β15Mar 29, 2024Updated 2 years ago
- β12May 28, 2024Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Supersetβ49Apr 5, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An Objective-C library for uploading shots to Dribbble.β13Mar 27, 2012Updated 14 years ago
- Use MobileNet SSD and openCV to detect and count car on roadβ11Jan 13, 2020Updated 6 years ago
- A python script to convert your youtube URL to an mp3 file and download it to the same directory as the .py file.β10May 20, 2025Updated last year
- β10Jul 19, 2020Updated 5 years ago
- Deploy a complete data stack in just a couple of minutes.β15Mar 6, 2024Updated 2 years ago
- β13Sep 23, 2023Updated 2 years ago
- Practice notebooks for NumPy, Pandas, matplotlib, basic machine learning etc.β13Nov 20, 2017Updated 8 years ago
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.β18Jun 19, 2022Updated 3 years ago
- A sophisticated exploration of dbt macro capabilities, pushing the boundaries of what's possible with dbt's macro system.β18Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,β¦β48Oct 14, 2024Updated last year
- β16Apr 18, 2025Updated last year
- learning-by-doing data model built with dbt-coreβ17Apr 10, 2026Updated 2 months ago
- um its my portfolio?β16Updated this week
- β11Feb 24, 2022Updated 4 years ago
- β15Updated this week
- β17Nov 27, 2025Updated 6 months ago
- CMU 15-712 lecture slidesβ11Jan 6, 2020Updated 6 years ago
- This workshop will familiarize you with some of the key steps towards building an autonomous driving data lake and extracting images fromβ¦β10Jul 12, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- http://archive.ics.uci.edu/ml/index.htmlβ12Jan 25, 2020Updated 6 years ago
- β18May 27, 2025Updated last year
- Create an Anime database containing all the Anime currently available on the website, which includes: 'Anime Title', 'Description', 'Cβ¦β11Jun 10, 2020Updated 6 years ago
- 5 Claude Code skills for product managers. Drop them in your .claude/skills/ folder and go.β84Mar 4, 2026Updated 3 months ago
- β16Apr 26, 2020Updated 6 years ago
- β22Mar 15, 2011Updated 15 years ago
- Miscellaneous codes and writings for MLOpsβ15Apr 8, 2026Updated 2 months ago
- Interactive web-based dashboard to manage traffic flow using YOLOX, DeepSORTβ12Jul 30, 2022Updated 3 years ago
- Create and Run π Dotfiles projects for Windows 10/11β23Jan 26, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Machine Learning Model and Deployment for Classification of Mango Varietiesβ10Dec 22, 2022Updated 3 years ago
- distributed computing toolkit in rustβ22Sep 21, 2018Updated 7 years ago
- A local-first, terminal-based password manager built for people who care about security, simplicity, and controlβ39Dec 31, 2025Updated 5 months ago
- Using data from IBM Watson, descriptive and predictive analytics using Python and tableauβ12Dec 23, 2017Updated 8 years ago
- Apache Airflow advanced functionalities examplesβ21Mar 22, 2024Updated 2 years ago
- Create a chatbot that provides responses in Vietnamese, focusing on the products offered by a flower shopβ11Nov 14, 2024Updated last year
- Predicting Car Prices with FastAPI, Streamlit, MLflow, Kafka, and Debezium: A Practical Demonstrationβ25Nov 12, 2024Updated last year