provectus / reference-dockerfilesLinks
Reference Dockerfiles for production usage
☆24Updated 6 years ago
Alternatives and similar repositories for reference-dockerfiles
Users that are interested in reference-dockerfiles are comparing it to the libraries listed below
Sorting:
- This Java library has been designed to facilitate leader election within Kafka clusters providing an efficient and robust solution for di…☆29Updated 2 years ago
- Testing LLMs and RAG configurations at scale using an OpenAI Reflector☆11Updated 11 months ago
- Data Quality Gate based on AWS☆57Updated last year
- ☆24Updated 3 years ago
- Distributed run of dbt models using Airflow☆168Updated last month
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous…☆161Updated 2 months ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Updated last year
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business…☆1,370Updated last week
- 📙 Awesome Data Catalogs and Observability Platforms.☆955Updated 4 months ago
- The most popular ClickHouse plugin for Airflow. 🔝 Top-1% downloads on PyPI: https://pypi.org/project/airflow-clickhouse-plugin! Based on…☆171Updated last week
- Grafana dashboards and StatsD exporter config for Airflow monitoring☆289Updated last year
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆64Updated 2 years ago
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆23Updated last year
- Drop-in replacement for Apache Spark UI☆373Updated 3 weeks ago
- ITSumma Spark Greenplum Connector☆42Updated last year
- Spark Structured Streaming data pipeline that processes movie ratings data in real-time.☆13Updated last week
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆197Updated 2 weeks ago
- Data catalog for everything in your company☆50Updated 2 years ago
- Python API for Deequ☆806Updated 8 months ago
- Avro SerDe for Apache Spark structured APIs.☆238Updated 6 months ago
- Learning resources for Airflow Tutorial article.☆57Updated 5 years ago
- A Generalized Metadata Search & Discovery Tool☆30Updated this week
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆503Updated last month
- PySpark test helper methods with beautiful error messages☆740Updated last week
- Spark in Kubernetes☆39Updated last year
- Материалы курса Airflow 101☆15Updated 5 years ago
- Spark style guide☆271Updated last year
- ☆18Updated 4 years ago
- ☆12Updated 4 years ago
- trino monitoring with JMX metrics through Prometheus and Grafana☆16Updated last year