Collection of dockerized ETL jobs managed by data engineering.
☆21Updated this week
Alternatives and similar repositories for docker-etl
Users that are interested in docker-etl are comparing it to the libraries listed below
Sorting:
- ETL jobs for Firefox Telemetry☆29Nov 7, 2025Updated 3 months ago
- Bigquery ETL☆329Feb 21, 2026Updated last week
- download all oral & spotlight papers from neurips, iclr, icml or any openreview conference☆21Dec 6, 2025Updated 2 months ago
- 💀 Periodically shakes your mouse pointer☆13Jan 21, 2024Updated 2 years ago
- A compilation of main commands for scikit-learn with examples☆11Apr 4, 2023Updated 2 years ago
- Azure DevOps Deployment Tasks for Azure Data Factory objects☆40Jun 3, 2025Updated 8 months ago
- ADK apps for Product Engineers☆28Jan 8, 2026Updated last month
- Data Mining project 2020/2021 @ University of Pisa☆13Mar 9, 2021Updated 4 years ago
- DEPRECATED: Mozilla Build Metadata Service☆13Jun 27, 2019Updated 6 years ago
- A no-dependency library to send standardized events to observability and data platforms. Based on plugins, Stratum enables the cataloging…☆26Feb 8, 2026Updated 3 weeks ago
- Blueprints repo, new samples, ARM Templates for Blueprints, exported/importable Blueprints☆10Jan 9, 2025Updated last year
- yasgi is a tiny ASGI framework ✨☆10Apr 29, 2024Updated last year
- Code samples for an Ignite conference presentation on the topic of Automating Azure SQL Data Warehouse☆11Mar 21, 2023Updated 2 years ago
- GraphQL Client for Pythonistas☆10Mar 26, 2020Updated 5 years ago
- (Deprecated) Ansible roles to configure assorted compontents for an Ubuntu VM or container configured with https://github.com/galaxyproje…☆11Nov 9, 2024Updated last year
- Example DAGs for Airflow 2.9☆11May 20, 2024Updated last year
- Functional todo list app demonstrating DuckDB WASM OPFS persistence☆21Oct 20, 2025Updated 4 months ago
- Quantifying and reporting uncertainty in drug discovery predictions with probabilistic models☆11Apr 29, 2022Updated 3 years ago
- 🌺 Petal - Flask, for gRPC services.☆12Apr 21, 2019Updated 6 years ago
- A Zoom utility for the terminal.☆10May 4, 2020Updated 5 years ago
- A utility to manage FIDO2 devices☆15Jun 24, 2024Updated last year
- 🤖 An autonomous AI agent system that collaboratively designs, implements, and manages Apache Airflow DAGs through natural language inter…☆28Aug 6, 2025Updated 6 months ago
- agogosml is a flexible data processing pipeline that addresses the common need for operationalizing ML models at scale☆34May 3, 2019Updated 6 years ago
- Spark bindings for Mozilla Telemetry☆15Jan 22, 2026Updated last month
- MLflow logging for PyMC☆14Aug 31, 2025Updated 6 months ago
- https://www.distributedpython.com/2018/05/01/unit-testing-celery-tasks☆13Jun 26, 2018Updated 7 years ago
- Repository used to main group ACLs used by Kubeflow developers☆18Feb 17, 2026Updated last week
- Stricter subset of dplyr☆13Dec 8, 2019Updated 6 years ago
- A small data lake meant for solitary use☆16Jan 28, 2025Updated last year
- ☆14Jan 11, 2023Updated 3 years ago
- Elasticsearch , Logstash, Kibana Scripts used for the SharePoint ULS indexing.☆13Nov 27, 2015Updated 10 years ago
- Alpha for notify API. Sends emails/sms/printed content on behalf of government.☆15Feb 8, 2016Updated 10 years ago
- Hands-On Data Warehousing with Azure Data Factory, published by Packt☆15Jan 18, 2023Updated 3 years ago
- ☆15Oct 27, 2022Updated 3 years ago
- An interface to libseccomp using ctypes. API compatible with libseccomp's Python bindings.☆14Jun 12, 2021Updated 4 years ago
- Process very large Stan models efficiently☆14Dec 5, 2025Updated 2 months ago
- A list of MCP services for popular data tools☆20Jul 14, 2025Updated 7 months ago
- GitHub Action That Submits Argo Workflows For Execution on Your GKE Cluster☆16Jan 25, 2021Updated 5 years ago
- ARCHIVED Generate Code from BNF Grammars☆12May 10, 2022Updated 3 years ago