provectus / reference-dockerfilesLinks
Reference Dockerfiles for production usage
☆24Updated 5 years ago
Alternatives and similar repositories for reference-dockerfiles
Users that are interested in reference-dockerfiles are comparing it to the libraries listed below
Sorting:
- Testing LLMs and RAG configurations at scale using an OpenAI Reflector☆11Updated 11 months ago
- This Java library has been designed to facilitate leader election within Kafka clusters providing an efficient and robust solution for di…☆29Updated 2 years ago
- Data Quality Gate based on AWS☆57Updated last year
- ☆24Updated 3 years ago
- Distributed run of dbt models using Airflow☆168Updated last month
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Updated last year
- Spark style guide☆266Updated last year
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆23Updated 11 months ago
- The most popular ClickHouse plugin for Airflow. 🔝 Top-1% downloads on PyPI: https://pypi.org/project/airflow-clickhouse-plugin! Based on…☆170Updated last week
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous…☆162Updated 2 months ago
- Grafana dashboards and StatsD exporter config for Airflow monitoring☆288Updated last year
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business…☆1,368Updated 9 months ago
- Drop-in replacement for Apache Spark UI☆364Updated 2 weeks ago
- 📙 Awesome Data Catalogs and Observability Platforms.☆952Updated 4 months ago
- ☆18Updated 4 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆586Updated last year
- Airflow declarative DAGs via YAML☆133Updated 2 years ago
- PySpark test helper methods with beautiful error messages☆735Updated last week
- ☆269Updated last year
- Avro SerDe for Apache Spark structured APIs.☆238Updated 6 months ago
- Adaptation postgres adapter for Greenplum☆36Updated last year
- Apache Airflow integration for dbt☆412Updated last year
- Data catalog for everything in your company☆50Updated 2 years ago
- Apache Spark testing helpers (dependency free & works with Scalatest, uTest, and MUnit)☆451Updated 4 months ago
- Data Engineering misc☆14Updated 4 years ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆801Updated last month
- Python API for Deequ☆806Updated 8 months ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆346Updated last year
- The Clickhouse plugin for dbt (data build tool)☆317Updated this week
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆185Updated last week