bartosz25/acid-file-formats

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bartosz25/acid-file-formats)

bartosz25 / acid-file-formats

Code for Apache Hudi, Apache Iceberg and Delta Lake analysis

☆10

Alternatives and similar repositories for acid-file-formats

Users that are interested in acid-file-formats are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

newfront / hitchhikers_guide_to_deltalake_streaming
View on GitHub
Don't Panic. This guide will help you when it feels like the end of the world.
☆31Feb 7, 2026Updated 5 months ago
bartosz25 / spark-docker
View on GitHub
Repository containing Docker images for Spark master and slave
☆15Nov 3, 2019Updated 6 years ago
bartosz25 / spark-playground
View on GitHub
Code snippets used in demos recorded for the blog.
☆42Apr 30, 2026Updated 2 months ago
wricardo / grpcurl-mcp
View on GitHub
Model Context Protocol (MCP) server to interact with gRPC services using the grpcurl tool
☆17Mar 5, 2025Updated last year
eadgbear / spark-wasm-udf
View on GitHub
Using WASM to write UDFs in Apache Spark
☆12Jun 3, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
arrikto / learn-kubeflow
View on GitHub
Learn Kubeflow with Arrikto
☆15Jan 4, 2022Updated 4 years ago
huydx / fulltext_engine
View on GitHub
simple inverted index full text search engine written in python
☆13Oct 3, 2013Updated 12 years ago
bartosz25 / spark-scala-playground
View on GitHub
Sample processing code using Spark 2.1+ and Scala
☆51Jun 28, 2020Updated 6 years ago
quiltdata / examples
View on GitHub
☆12Oct 24, 2025Updated 8 months ago
newfront / spark-intro-to-ml
View on GitHub
A Gentle introduction to Machine Learning with Apache Spark
☆11Mar 2, 2026Updated 4 months ago
laplab / rhino
View on GitHub
An experimental edge key-value database built on top of FoundationDB.
☆11Jan 9, 2025Updated last year
bartosz25 / data-ai-summit-2024
View on GitHub
Visits sessionization pipeline used for the talk
☆13May 28, 2024Updated 2 years ago
ognis1205 / mcp-server-unitycatalog
View on GitHub
Unity Catalog AI Model Context Protocol Server
☆15Mar 28, 2025Updated last year
huyhoang17 / framler
View on GitHub
[DEPRECATED] AutoCrawler - automate extracting main information from website
☆16Jun 10, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
adamw / fp-stack-2020-pres
View on GitHub
☆13Dec 12, 2020Updated 5 years ago
projectnessie / nessie-demos
View on GitHub
Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.
☆32Updated this week
pH7Software / pH7-Documentation
View on GitHub
📓 The Documentation website for the "pH7 Social Dating Builder" Software.
☆10May 13, 2023Updated 3 years ago
bufbuild / registry-proto
View on GitHub
BSR's new public API. Currently in development.
☆23Updated this week
phatak-dev / Statistical-Data-Exploration-Using-Spark-2.0
View on GitHub
Data Exploration Using Spark 2.0
☆14Apr 17, 2018Updated 8 years ago
the-pavels / train-station
View on GitHub
Demo application built on top of Apache Pulsar
☆18Feb 8, 2026Updated 5 months ago
odnoklassniki / spark-to-clickhouse-sink
View on GitHub
☆18Oct 11, 2021Updated 4 years ago
godatadriven / dbt-data-ai-summit
View on GitHub
Code that was used as an example during the Data+AI Summit 2020
☆15Mar 8, 2021Updated 5 years ago
jentrata / jentrata-msh
View on GitHub
Jentrata - Message Handler Service
☆18Sep 8, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
nevakrien / Turing-compiler
View on GitHub
an optimizing compiler to a binary turing machine
☆13Dec 16, 2024Updated last year
dhiraa / blockchain-streaming
View on GitHub
Structured Streaming using Apache Spark on Binance Blockchain Stream
☆16May 2, 2018Updated 8 years ago
rockthejvm / spark-performance-tuning
View on GitHub
The official repository for the Rock the JVM Spark Optimization 2 course
☆44Jun 20, 2026Updated 3 weeks ago
rockthejvm / udemy-akka-persistence-starter
View on GitHub
The official Rock the JVM Akka Persistence Starter project
☆11Apr 4, 2019Updated 7 years ago
japila-books / delta-lake-internals
View on GitHub
The Internals of Delta Lake
☆186Jun 18, 2026Updated 3 weeks ago
accelerant-dev / implement-rougedb
View on GitHub
Code for the course Implement RougeDB: A Redis clone from outer space written in Rust
☆11Jan 15, 2025Updated last year
Radico / trino-plugins
View on GitHub
Simplified custom plugins for Trino
☆16Jul 29, 2024Updated last year
christianroman / df-gtfs
View on GitHub
Script para importar dataset de "df_gtfs" a PostgreSQL
☆13Jun 24, 2013Updated 13 years ago
rockthejvm / udemy-akka-http
View on GitHub
For Udemy students: the official repository for the Rock the JVM Akka HTTP with Scala course
☆14Apr 27, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pH7Software / pH7Builder
View on GitHub
🚀A totally free and open source dating webapp software built with PHP (with pH7Framework) 📱
☆12Jun 11, 2022Updated 4 years ago
radumarias / syncoxiders
View on GitHub
Cloud file and email Sync, file Sharing, inter-cloud Encryption and Backup solution written in Rust and modern technologies
☆14May 15, 2026Updated last month
aimdb-dev / aimdb
View on GitHub
One API from microcontroller to browser. Define data contracts once, enforce everywhere.
☆96Updated this week
tubean / MSA-springboot-eureka
View on GitHub
Microservice with SpringBoot, Eureka server and Zuul
☆15Dec 20, 2018Updated 7 years ago
mshtelma / spark-structured-streaming-jdbc-sink
View on GitHub
Spark Structured Streaming JDBC Sink
☆16Apr 26, 2021Updated 5 years ago
rogeriomm / labtools-k8s
View on GitHub
Complete data engineering pipeline running on Minikube Kubernetes, Argo CD, Spark, Trino, S3, Delta lake, Postgres+ Debezium CDC, MySQL,…
☆29May 19, 2025Updated last year
johnlemon93 / blog-page-ssr
View on GitHub
static blog engine - blog page
☆10Dec 24, 2020Updated 5 years ago