provectus / reference-dockerfilesLinks
Reference Dockerfiles for production usage
☆24Updated 5 years ago
Alternatives and similar repositories for reference-dockerfiles
Users that are interested in reference-dockerfiles are comparing it to the libraries listed below
Sorting:
- This Java library has been designed to facilitate leader election within Kafka clusters providing an efficient and robust solution for di…☆25Updated 2 years ago
- Testing LLMs and RAG configurations at scale using an OpenAI Reflector☆11Updated 5 months ago
- Data Quality Gate based on AWS☆56Updated 11 months ago
- ☆24Updated 2 years ago
- Distributed run of dbt models using Airflow☆164Updated 3 weeks ago
- ODD Specification is a universal open standard for collecting metadata.☆142Updated 7 months ago
- 📙 Awesome Data Catalogs and Observability Platforms.☆866Updated 2 months ago
- MLOps Platform☆273Updated 7 months ago
- DE or DIE meetup made by data engineers for data engineers. Currently in Russian only.☆58Updated last year
- Web UI for the Hydrosphere.io project.☆11Updated 2 years ago
- Serverless proxy for Spark cluster☆326Updated 4 years ago
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆344Updated last year
- Avro SerDe for Apache Spark structured APIs.☆236Updated 2 weeks ago
- The most popular ClickHouse plugin for Airflow. 🔝 Top-1% downloads on PyPI: https://pypi.org/project/airflow-clickhouse-plugin! Based on…☆155Updated last month
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆21Updated 6 months ago
- Drop-in replacement for Apache Spark UI☆269Updated last week
- Python API for Deequ☆779Updated 2 months ago
- Spark style guide☆259Updated 8 months ago
- This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spa…☆764Updated 3 weeks ago
- PySpark test helper methods with beautiful error messages☆699Updated 2 weeks ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆672Updated 3 months ago
- Essential Spark extensions and helper methods ✨😲☆761Updated 8 months ago
- The Internals of Delta Lake☆184Updated 5 months ago
- Spark in Kubernetes☆39Updated last year
- DBND is an agile pipeline framework that helps data engineering teams track and orchestrate their data processes.☆266Updated 3 months ago
- Qubole Sparklens tool for performance tuning Apache Spark☆579Updated last year
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆433Updated 4 months ago
- Docker Compose with Almond.sh core for Jupyter☆19Updated 9 months ago
- Simple demo using "behave" and "pyspark" libraries to test data transformations in a human-readable way☆10Updated 6 years ago
- Adaptation postgres adapter for Greenplum☆36Updated last year