Data quality control tool built on spark and deequ
☆25May 9, 2026Updated last week
Alternatives and similar repositories for data-flare
Users that are interested in data-flare are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-end Machine Learning Pipeline demo using Delta Lake, MLflow and AzureML in Azure Databricks☆18Nov 9, 2019Updated 6 years ago
- Some Avro operations in Scala☆10May 6, 2026Updated 2 weeks ago
- A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.☆13Oct 27, 2021Updated 4 years ago
- Deriving Spark DataFrame schemas from case classes☆44Jun 24, 2024Updated last year
- 大规模社交数据可视化分析工具☆19Sep 18, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Cloud based Data Platform based on Apache Spark☆28May 6, 2026Updated 2 weeks ago
- Flink jobs collection☆17Oct 13, 2020Updated 5 years ago
- Some random how-to examples relating to Databricks.☆15Nov 3, 2021Updated 4 years ago
- ☆14Feb 10, 2026Updated 3 months ago
- ☆16Mar 18, 2026Updated 2 months ago
- [DEPRECATED] ETH 2.0 SSZ - optimized Go implementation☆13Jun 21, 2020Updated 5 years ago
- Minimal Viable Data Sync Implementation☆13Aug 29, 2023Updated 2 years ago
- OpenTelemetry agent for Scala applications☆72Updated this week
- An example of building kubernetes operator (Flink) using Abstract operator's framework☆26Jul 12, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Drop in replacement for golang/crypto/ed25519 with additional functionality☆15Feb 28, 2023Updated 3 years ago
- ☆19Jul 25, 2023Updated 2 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Jan 22, 2024Updated 2 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆63Sep 6, 2024Updated last year
- Proxy for testing network disconnects and jitter/throttling☆18Apr 13, 2026Updated last month
- MLOps Lab Example using PyTorch to predict Yelp Reviews☆21Mar 20, 2021Updated 5 years ago
- Command line tool for converting images to ASCII art☆20May 1, 2026Updated 2 weeks ago
- ZkMarek is an educational project created by ethmarek, as an exercise to learn cryptography, with focus on understanding Plonk.☆15Jul 2, 2025Updated 10 months ago
- Tool to wait until a database is up and responding to a query☆17Apr 24, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DataQuality for BigData☆149Dec 15, 2023Updated 2 years ago
- Avro Schema Registry (mostly) compatible with salsify/avro-schema-registry☆20Apr 29, 2025Updated last year
- My Study guide used to pass the CRT020 Spark Certification exam☆34Jan 6, 2020Updated 6 years ago
- A testlab built with Nomad and Consul to analyze the behavior of p2p networks at scale☆22Jul 26, 2019Updated 6 years ago
- Push "button deploy" literally☆18Feb 15, 2016Updated 10 years ago
- Procedural macro for automatically implementing metrics description and initialization.☆24Apr 14, 2026Updated last month
- CDK Node core repo☆23Feb 5, 2026Updated 3 months ago
- ☆11May 26, 2021Updated 4 years ago
- I'll munch some data here☆12Jun 18, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Apache Amaterasu☆57Oct 18, 2019Updated 6 years ago
- Sample applications using Dozer☆16Feb 3, 2024Updated 2 years ago
- basic on the project☆18Apr 12, 2019Updated 7 years ago
- Simple UI cli LLaMA Model Finetuning☆10Mar 23, 2023Updated 3 years ago
- ansible with kubernetes☆10Feb 14, 2023Updated 3 years ago
- A convenient JavaScript interface to the Melon protocol Ethereum smart contracts.☆17Dec 12, 2020Updated 5 years ago
- A Modern and configurable CLI for managing kafka connect clusters.☆13Dec 3, 2023Updated 2 years ago