linkedin/iceberg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linkedin/iceberg)

linkedin / iceberg

A home for LinkedIn's changes to Apache Iceberg

☆65

Alternatives and similar repositories for iceberg

Users that are interested in iceberg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

linkedin / linkedin-calcite
View on GitHub
LinkedIn's version of Apache Calcite
☆23Jul 15, 2025Updated last year
linkedin / transport
View on GitHub
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆306Jun 29, 2026Updated 3 weeks ago
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆907Updated this week
absognety / atomic-scala
View on GitHub
Atomic Scala Book Solutions - for Beginners and first time Functional Programmers
☆12Mar 10, 2020Updated 6 years ago
SETL-Framework / setl
View on GitHub
A simple Spark-powered ETL framework that just works 🍺
☆186Oct 2, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
apache / kyuubi-website
View on GitHub
Apache Kyuubi Site
☆13Jun 12, 2026Updated last month
aravinthsci / Spark_Delta_Lake
View on GitHub
Delta Lake Examples
☆11Apr 24, 2020Updated 6 years ago
coocood / stadis
View on GitHub
Stand-alone Distributed System, test distributed system on localhost.
☆30Apr 23, 2014Updated 12 years ago
kelindar / timeline
View on GitHub
Scheduler of events for near real-time systems
☆31Aug 21, 2025Updated 11 months ago
uber / RemoteShuffleService
View on GitHub
Remote shuffle service for Apache Spark to store shuffle data on remote servers.
☆335Sep 29, 2023Updated 2 years ago
YotpoLtd / metorikku
View on GitHub
A simplified, lightweight ETL Framework based on Apache Spark
☆588Jan 24, 2024Updated 2 years ago
ABigdataer / UserPortrait
View on GitHub
基于Flink的用户画像系统
☆10Dec 10, 2022Updated 3 years ago
txn2 / datalab
View on GitHub
Custom JupyterLab container for local-workstations and in-cluster Kubernetes Data Science, Machine Learning and IoT.
☆12Aug 22, 2019Updated 6 years ago
AbsaOSS / spline-spark-agent
View on GitHub
Spline agent for Apache Spark
☆207Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
qubole / spark-acid
View on GitHub
ACID Data Source for Apache Spark based on Hive ACID
☆97Jul 7, 2021Updated 5 years ago
apache / iceberg
View on GitHub
Apache Iceberg
☆9,067Updated this week
Renien / ETL-Starter-Kit
View on GitHub
Extract, Transform, Load (ETL) refers to a process in database usage and especially in data warehousing. This repository contains a s…
☆21Mar 20, 2017Updated 9 years ago
satoshihirose / how-to-use-avro-tools
View on GitHub
this repogitory describe how to use avro-tools
☆12Feb 21, 2018Updated 8 years ago
mozhijun / easybms-ssm
View on GitHub
基于SSM的后台管理系统-EasyBMS
☆13Dec 16, 2022Updated 3 years ago
dermatologist / omopfhirmap
View on GitHub
OMOP <-> FHIR mapper
☆11Mar 6, 2023Updated 3 years ago
jackwaudby / awesome-consistency
View on GitHub
Awesome list of consistency models
☆19Dec 22, 2021Updated 4 years ago
atopia / sandcrust
View on GitHub
Sandboxing C in Rust
☆19Jun 16, 2025Updated last year
linkedin / openhouse
View on GitHub
Open Control Plane for Tables in Data Lakehouse
☆392Jul 14, 2026Updated last week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
matrixorigin / matrixkv
View on GitHub
This is a distributed kv project to demonstrate how to use matrixcube
☆17Sep 8, 2022Updated 3 years ago
ahupowerdns / ahutils
View on GitHub
attempt to create a library of code snippets I use a lot
☆18Oct 3, 2014Updated 11 years ago
eternalstone / SensitiveBye
View on GitHub
SensitiveBye是一款专注于解决数据脱敏的Java和SpringBoot工具包, 能帮助您快速解决项目中的脱敏需求，支持对象字段，接口字段，数据库字段脱敏，json序列化脱敏，日志打印脱敏、敏感词条脱敏、Spring配置文件脱敏等功能
☆12Jun 5, 2025Updated last year
apache / hudi
View on GitHub
Upserts, Deletes And Incremental Processing on Big Data.
☆6,193Updated this week
victorcouste / trino-datastudio-connector
View on GitHub
Trino Community Connector for Google Data Studio
☆11Jan 5, 2022Updated 4 years ago
magthe / sandi
View on GitHub
Data encoding library for Haskell.
☆12Aug 4, 2023Updated 2 years ago
NACHC-CAD / fhir-to-omop
View on GitHub
☆12May 30, 2025Updated last year
miku / filterline
View on GitHub
Command line tool to filter file by line number.
☆12Jul 9, 2025Updated last year
albfan / mvnexec
View on GitHub
bash script to find and execute java classes with main methods
☆20Oct 24, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
king / bravo
View on GitHub
Utilities for processing Flink checkpoints/savepoints
☆75Dec 11, 2019Updated 6 years ago
xo / tblfmt
View on GitHub
streaming, buffered table encoder for result sets (ie from a database)
☆22Jun 13, 2026Updated last month
jeff-davis / postgres-extension.rs
View on GitHub
Write PostgreSQL extensions in pure rust
☆36Aug 9, 2023Updated 2 years ago
apache / celeborn
View on GitHub
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
☆1,056Updated this week
databricks-industry-solutions / interop
View on GitHub
From FHIR ingestion to patient outcomes analysis
☆15Dec 2, 2024Updated last year
pgulutzan / descriptive-sql-style-guide
View on GitHub
Descriptive SQL style guide
☆13Jun 16, 2022Updated 4 years ago
snowplow-archive / iglu-example-schema-registry
View on GitHub
Example static schema registry for Iglu
☆15Jun 21, 2023Updated 3 years ago