provectus / reference-dockerfilesLinks
Reference Dockerfiles for production usage
☆24Updated 6 years ago
Alternatives and similar repositories for reference-dockerfiles
Users that are interested in reference-dockerfiles are comparing it to the libraries listed below
Sorting:
- Testing LLMs and RAG configurations at scale using an OpenAI Reflector☆11Updated last year
- This Java library has been designed to facilitate leader election within Kafka clusters providing an efficient and robust solution for di…☆30Updated 2 years ago
- ☆24Updated 3 years ago
- Data Quality Gate based on AWS☆57Updated last year
- Distributed run of dbt models using Airflow☆168Updated 2 months ago
- Data Forge — a modern data stack playground to practice flows and best practices, not just tools. Spark, Trino, Kafka, Iceberg, ClickHous…☆168Updated 3 months ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Updated 2 years ago
- Data catalog for everything in your company☆50Updated 2 years ago
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆23Updated last year
- 📙 Awesome Data Catalogs and Observability Platforms.☆987Updated 5 months ago
- ☆18Updated 4 years ago
- First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business…☆1,374Updated 3 weeks ago
- The most popular ClickHouse plugin for Airflow. 🔝 Top-1% downloads on PyPI: https://pypi.org/project/airflow-clickhouse-plugin! Based on…☆174Updated last month
- protobuf pyspark conversion☆23Updated 2 years ago
- Avro SerDe for Apache Spark structured APIs.☆241Updated 7 months ago
- Spark style guide☆271Updated last year
- Docker Compose with Almond.sh core for Jupyter☆18Updated last year
- A simplified, lightweight ETL Framework based on Apache Spark☆589Updated 2 years ago
- Drop-in replacement for Apache Spark UI☆397Updated last week
- Adaptation postgres adapter for Greenplum☆36Updated last year
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆188Updated last month
- The Internals of Delta Lake☆187Updated 2 months ago
- ODD Specification is a universal open standard for collecting metadata.☆146Updated last year
- Airflow declarative DAGs via YAML☆133Updated 2 years ago
- Grafana dashboards and StatsD exporter config for Airflow monitoring☆292Updated last year
- Data Engineering Digest☆29Updated last year
- Learning resources for Airflow Tutorial article.☆56Updated 5 years ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆808Updated 3 weeks ago
- ☆19Updated 3 years ago
- YTsaurus SPYT provides an integration with Apache Spark☆19Updated this week