timgent/data-flare

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/timgent/data-flare)

timgent / data-flare

Data quality control tool built on spark and deequ

☆25

Alternatives and similar repositories for data-flare

Users that are interested in data-flare are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

simonw / datasette-llm-embed
View on GitHub
Datasette plugin adding a llm_embed(model_id, text) SQL function
☆18Mar 17, 2024Updated 2 years ago
irajhedayati / savro
View on GitHub
Some Avro operations in Scala
☆10Jun 29, 2026Updated 3 weeks ago
eikek / yamusca
View on GitHub
Yet another mustache impl for scala
☆22Oct 15, 2025Updated 9 months ago
decanus / bureka
View on GitHub
Pastry DHT implementation with a standalone libp2p compatible node
☆12Jun 22, 2020Updated 6 years ago
colbyford / sparkitecture
View on GitHub
A collection of “cookbook-style” scripts for simplifying data engineering and machine learning in Apache Spark.
☆13Oct 27, 2021Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Azure / Azure-AI-Camp
View on GitHub
Azure AI Camp - 2 day workshop on Databricks and Azure ML
☆20Jul 23, 2023Updated 2 years ago
okkam-it / flink-examples
View on GitHub
Flink jobs collection
☆17Oct 13, 2020Updated 5 years ago
homeaway / datapull
View on GitHub
Cloud based Data Platform based on Apache Spark
☆28Jun 30, 2026Updated 3 weeks ago
justinbreese / databricks-gems
View on GitHub
Some random how-to examples relating to Databricks.
☆15Nov 3, 2021Updated 4 years ago
lightbend / flink-k8s-operator
View on GitHub
An example of building kubernetes operator (Flink) using Abstract operator's framework
☆26Jul 12, 2019Updated 7 years ago
Jimaras08 / mlops-lab-example-yelp
View on GitHub
MLOps Lab Example using PyTorch to predict Yelp Reviews
☆21Mar 20, 2021Updated 5 years ago
datamindedbe / lighthouse
View on GitHub
Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…
☆64Sep 6, 2024Updated last year
Apkawa / pytest-image-diff
View on GitHub
pytest helps for compare images and regression
☆13Dec 31, 2024Updated last year
nlopes / avro-schema-registry
View on GitHub
Avro Schema Registry (mostly) compatible with salsify/avro-schema-registry
☆20Apr 29, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
agile-lab-dev / DataQuality
View on GitHub
DataQuality for BigData
☆149Dec 15, 2023Updated 2 years ago
libp2p / testlab
View on GitHub
A testlab built with Nomad and Consul to analyze the behavior of p2p networks at scale
☆22Jul 26, 2019Updated 6 years ago
oktadev / okta-java-ee-rest-api-example
View on GitHub
Java EE REST API + Security with JWT and OIDC
☆13Sep 24, 2018Updated 7 years ago
absognety / atomic-scala
View on GitHub
Atomic Scala Book Solutions - for Beginners and first time Functional Programmers
☆12Mar 10, 2020Updated 6 years ago
mdrakiburrahman / databricks-certification
View on GitHub
My Study guide used to pass the CRT020 Spark Certification exam
☆34Jan 6, 2020Updated 6 years ago
simonwuelker / Stormlicht
View on GitHub
The Stormlicht browser engine.
☆15Aug 13, 2024Updated last year
solid-contrib / practitioners
View on GitHub
A hub for Solid developers
☆37Jun 22, 2026Updated 3 weeks ago
aws-samples / amazon-deequ-glue
View on GitHub
Automated data quality suggestions and analysis with Deequ on AWS Glue
☆93Dec 29, 2022Updated 3 years ago
tanakh / axum-yew-shuttle-realworld-example-app
View on GitHub
Starter kit for new RealWorld framework implementations
☆14Dec 16, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
rstudio / vetiver.posit.co
View on GitHub
Website for the vetiver 🏺 framework
☆12May 28, 2025Updated last year
lucacasonato / deno_googleapis
View on GitHub
☆23Aug 7, 2023Updated 2 years ago
rogeriochaves / notebooks
View on GitHub
I'll munch some data here
☆12Jun 18, 2021Updated 5 years ago
MonetDB / MonetDBLite-Java
View on GitHub
☆11May 26, 2021Updated 5 years ago
getdozer / dozer-samples
View on GitHub
Sample applications using Dozer
☆16Feb 3, 2024Updated 2 years ago
bert2 / build-your-own-sqlite-rust
View on GitHub
My 🦀 solution for https://codecrafters.io/challenges/sqlite
☆15May 23, 2022Updated 4 years ago
HackerAIOfficial / simple-llama-finetuner
View on GitHub
Simple UI cli LLaMA Model Finetuning
☆10Mar 23, 2023Updated 3 years ago
denoland / chatspace
View on GitHub
Real-time, collaborative GPT frontend built with Deno KV
☆25Feb 14, 2024Updated 2 years ago
ericmjl / fundl
View on GitHub
A pedagogical, functional-oriented deep learning library built on top of jax.
☆15Jul 19, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
davebrace / vim-testnav
View on GitHub
Easier navigation between production and test files in vim.
☆11Mar 16, 2017Updated 9 years ago
biglocalnews / prefect-flow-template
View on GitHub
A template repository with all the fundamentals needed to develop and deploy a Python data-processing routine for Prefect pipelines.
☆19Mar 29, 2022Updated 4 years ago
A-Fayez / kofr
View on GitHub
A Modern and configurable CLI for managing kafka connect clusters.
☆13Dec 3, 2023Updated 2 years ago
Jien8Huang / payflow-payments-ops-platform
View on GitHub
Payments platform project focused on production operations: tenant-safe APIs, idempotent money flows, transactional outbox for async proc…
☆15Jul 13, 2026Updated last week
naomijub / JVM-rust-ffi
View on GitHub
☆22Jun 6, 2022Updated 4 years ago
shashwot / ansible-kubernetes
View on GitHub
ansible with kubernetes
☆10Feb 14, 2023Updated 3 years ago
watergis / sveltekit-watergis-template
View on GitHub
This repository is a sveltekit template to develop GIS for water application quickly
☆22Updated this week