provectus / reference-dockerfilesLinks
Reference Dockerfiles for production usage
☆24Updated 5 years ago
Alternatives and similar repositories for reference-dockerfiles
Users that are interested in reference-dockerfiles are comparing it to the libraries listed below
Sorting:
- This Java library has been designed to facilitate leader election within Kafka clusters providing an efficient and robust solution for di…☆25Updated 2 years ago
- Testing LLMs and RAG configurations at scale using an OpenAI Reflector☆11Updated 6 months ago
- Data Quality Gate based on AWS☆56Updated last year
- ☆24Updated 3 years ago
- Distributed run of dbt models using Airflow☆165Updated last month
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Updated last year
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business…☆1,329Updated 4 months ago
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆22Updated 6 months ago
- The most popular ClickHouse plugin for Airflow. 🔝 Top-1% downloads on PyPI: https://pypi.org/project/airflow-clickhouse-plugin! Based on…☆156Updated 3 weeks ago
- Spark in Kubernetes☆39Updated last year
- Spark on Kubernetes infrastructure Helm charts repo☆203Updated 2 years ago
- 📙 Awesome Data Catalogs and Observability Platforms.☆872Updated 3 months ago
- Spark style guide☆259Updated 9 months ago
- Adaptation postgres adapter for Greenplum☆36Updated last year
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- Avro SerDe for Apache Spark structured APIs.☆235Updated last month
- ODD Specification is a universal open standard for collecting metadata.☆142Updated 8 months ago
- Orchestrate your Databricks notebooks in Airflow and execute them as Databricks Workflows☆23Updated 11 months ago
- Data catalog for everything in your company☆51Updated 2 years ago
- Drop-in replacement for Apache Spark UI☆274Updated 2 weeks ago
- Module for pipelines concept in PySpark☆16Updated last year
- ☆266Updated 8 months ago
- Learning resources for Airflow Tutorial article.☆55Updated 4 years ago
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆266Updated 3 months ago
- ☆18Updated 3 years ago
- Data Platform demo☆13Updated 5 months ago
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆436Updated this week
- A Python Library to support running data quality rules while the spark job is running⚡☆188Updated this week
- Airflow declarative DAGs via YAML☆132Updated last year
- PySpark test helper methods with beautiful error messages☆704Updated last week