irbigdata / data-dockerfilesLinks
a curated list of docker-compose files prepared for testing data engineering tools, databases and open source libraries.
☆584Updated 2 years ago
Alternatives and similar repositories for data-dockerfiles
Users that are interested in data-dockerfiles are comparing it to the libraries listed below
Sorting:
- The tools and sample needed to learn the Docker☆502Updated 2 years ago
- A comprehensive Spark guide collated from multiple sources that can be referred to learn more about Spark or as an interview refresher.☆687Updated 3 years ago
- Auto-generated Diagrams from Airflow DAGs. 🔮 🪄☆355Updated last week
- Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.☆2,229Updated last week
- Docker Makefiles☆94Updated last year
- Run popular commandline tools within docker☆1,274Updated 2 years ago
- Accumulated knowledge and experience in the field of Data Engineering☆871Updated 3 years ago
- re_data - fix data issues before your users & CEO would discover them 😊☆1,569Updated last year
- Compare tables within or across databases☆2,993Updated last year
- A curated list of awesome DataOps tools☆225Updated last month
- The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-host…☆2,242Updated this week
- Write python locally, execute SQL in your data warehouse☆268Updated 3 years ago
- The easiest way to run Airflow locally, with linting & tests for valid DAGs and Plugins.☆258Updated 4 years ago
- 🐳 The stupidly simple CLI workspace for your data warehouse.☆728Updated 2 years ago
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business…☆1,374Updated 3 weeks ago
- A plugin for Apache Airflow that allows you to edit DAGs in browser☆458Updated 2 weeks ago
- New Generation Opensource Data Stack Demo☆454Updated 3 years ago
- Selfhosted tech starter pack for development of new project or startup☆1,248Updated 2 years ago
- Metrics Observability & Troubleshooting☆328Updated last year
- A complete development environment setup for working with Airflow☆129Updated 2 years ago
- Data Contracts engine for the modern data stack. https://www.soda.io☆2,281Updated this week
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to b…☆805Updated 3 years ago
- A cookbook with the best practices to working with kubernetes.☆1,472Updated last month
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆113Updated 2 years ago
- Collection covers kubernetes exercises categorized topics-wise and referred back to the individual Kubernetes certification exams.☆268Updated last year
- Sync DAG changes from Git to Airflow☆70Updated 5 months ago
- Grafana dashboards and StatsD exporter config for Airflow monitoring☆292Updated last year
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆506Updated 3 months ago
- A curated collection of helpful SQL queries and functions, maintained by Count.☆208Updated 4 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆225Updated 9 months ago