a curated list of docker-compose files prepared for testing data engineering tools, databases and open source libraries.
☆584Sep 17, 2023Updated 2 years ago
Alternatives and similar repositories for data-dockerfiles
Users that are interested in data-dockerfiles are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A repo to automatically generate and keep updated a series of Docker images through GitHub Actions.☆561Mar 24, 2026Updated last week
- Bidirectional port-forwarding for docker, podman and kubernetes☆295May 7, 2022Updated 3 years ago
- Never worry about losing your code. Written in Go☆330Aug 28, 2022Updated 3 years ago
- A collection of vpns☆109Sep 28, 2022Updated 3 years ago
- This repository contains a collection of notebooks and scripts that demonstrate the essential Python skills for machine learning operatio…☆12Jul 16, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository hosts materials for the Docker for Data Engineers workshop, offering hands-on exercises and resources tailored for data e…☆17May 23, 2024Updated last year
- HAnoProxY is a DNS server offering proxyless high availability and load balancing for applications☆20Jul 26, 2022Updated 3 years ago
- Pytorch YoloV3 implementation from scratch☆14Mar 6, 2022Updated 4 years ago
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆64Mar 9, 2024Updated 2 years ago
- FastAPI CLI is a command-line tool designed to help developers quickly generate a structured project file system for FastAPI applications…☆12Feb 3, 2025Updated last year
- The tools and sample needed to learn the Docker☆503Dec 31, 2023Updated 2 years ago
- Hundreds of Offensive and Useful Docker Images for Network Intrusion. The name says it all.☆1,248Nov 21, 2025Updated 4 months ago
- Configuration and schema sync for Metabase from Python☆19Mar 23, 2023Updated 3 years ago
- Run popular commandline tools within docker☆1,275Aug 31, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Unbreakable Security, Guaranteed and Exactly Once Delivery☆28Dec 1, 2025Updated 3 months ago
- Machine Learning in Snowflake☆23Aug 21, 2019Updated 6 years ago
- A collection of free and open-source resources related to Iran, including maps, vector files, GeoJSON data, and more. This repository aim…☆22Oct 26, 2024Updated last year
- Get eBPF programs running from the cloud to the kernel in 1 line of bash☆1,297Apr 17, 2025Updated 11 months ago
- Build data pipelines, the easy way 🛠️☆4,138Jun 6, 2023Updated 2 years ago
- This course is designed to provide learners with the fundamental skills needed for data engineering using Python. The objective is to int…☆26Aug 15, 2024Updated last year
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,799Updated this week
- Write python locally, execute SQL in your data warehouse☆268Jul 5, 2022Updated 3 years ago
- Download and set bing daily wallpapers for Raspberry pi Desktop☆12Dec 6, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- example pipelines for deploying dbt via Azure DevOps pipelines☆21Apr 2, 2021Updated 4 years ago
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,623May 29, 2025Updated 10 months ago
- Script to train a German n-gram Language Model on articles of Wikipedia☆14Oct 20, 2018Updated 7 years ago
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Jul 7, 2021Updated 4 years ago
- Always know what to expect from your data.☆11,301Updated this week
- Free Persian Word Level OCR Dataset☆24Aug 1, 2020Updated 5 years ago
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆39,324Mar 19, 2026Updated last week
- ☆30Nov 17, 2022Updated 3 years ago
- A tool for finding memory leaks in web apps☆4,629Aug 12, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A web app built with Flask to stream video feed from any IP camera to the users of a local network☆14Dec 23, 2021Updated 4 years ago
- A comprehensive Spark guide collated from multiple sources that can be referred to learn more about Spark or as an interview refresher.☆688Apr 22, 2022Updated 3 years ago
- Golang Twitter Library☆27Jun 22, 2022Updated 3 years ago
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆114Sep 21, 2023Updated 2 years ago
- ☆276Mar 23, 2026Updated last week
- ZincSearch . A lightweight alternative to elasticsearch that requires minimal resources, written in Go.☆17,787Jan 23, 2026Updated 2 months ago
- Sample project to get started with dbt-power-user vscode extension using dev-container☆12Apr 5, 2024Updated last year