irbigdata / data-dockerfiles
a curated list of docker-compose files prepared for testing data engineering tools, databases and open source libraries.
☆577Updated last year
Alternatives and similar repositories for data-dockerfiles:
Users that are interested in data-dockerfiles are comparing it to the libraries listed below
- The tools and sample needed to learn the Docker☆497Updated last year
- A comprehensive Spark guide collated from multiple sources that can be referred to learn more about Spark or as an interview refresher.☆658Updated 2 years ago
- Write python locally, execute SQL in your data warehouse☆270Updated 2 years ago
- The Data Explorer gives you fast, safe access to data stored in Cassandra, Dynomite, and Redis.☆431Updated last year
- Use SQL to build ELT pipelines on a data lakehouse.☆284Updated 2 years ago
- A curated collection of helpful SQL queries and functions, maintained by Count.☆201Updated 3 years ago
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆50Updated 10 months ago
- A curated list of awesome DataOps tools☆169Updated 3 months ago
- Selfhosted tech starter pack for development of new project or startup☆1,226Updated last year
- The Open-Source Enterprise Data Platform in a single Portal☆227Updated this week
- ☆253Updated 3 weeks ago
- Delta-Lake, ETL, Spark, Airflow☆45Updated 2 years ago
- New generation opensource data stack☆65Updated 2 years ago
- Quickstart for any service☆136Updated this week
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆74Updated this week
- Sync DAG changes from Git to Airflow☆52Updated 4 months ago
- A data engineering project (Twitter monitor app)☆78Updated 2 years ago
- New Generation Opensource Data Stack Demo☆420Updated last year
- re_data - fix data issues before your users & CEO would discover them 😊☆1,563Updated 8 months ago
- Example Repo to have full end to end pyspark testing via docker-compose☆30Updated last year
- Resources for video demonstrations and blog posts related to DataOps on AWS☆172Updated 2 years ago
- Accumulated knowledge and experience in the field of Data Engineering☆865Updated 2 years ago
- A list of remote-friendly or full-remote companies that targets iranian talents.☆337Updated 2 years ago
- Grafana dashboards and StatsD exporter config for Airflow monitoring☆267Updated 10 months ago
- Collection covers kubernetes exercises categorized topics-wise and referred back to the individual Kubernetes certification exams.☆263Updated 2 months ago
- 🐺 Deploy Databases and Services Easily for Development and Testing Pipelines.☆725Updated 3 weeks ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆204Updated this week
- Tutorial for setting up a Spark cluster running inside of Docker containers located on different machines☆126Updated 2 years ago
- A curated list of open source tools used in analytics platforms and data engineering ecosystem☆181Updated 2 months ago