A write-audit-publish implementation on a data lake without the JVM
☆45Aug 12, 2024Updated last year
Alternatives and similar repositories for no-jvm-wap-with-iceberg
Users that are interested in no-jvm-wap-with-iceberg are comparing it to the libraries listed below
Sorting:
- Data Engineering Projects using Mage.ai as orchestrator☆19Jan 20, 2026Updated last month
- How to evaluate the Quality of your Data with Great Expectations and Spark.☆31Mar 29, 2023Updated 2 years ago
- Datalog implementation in Scala.☆12Jun 17, 2014Updated 11 years ago
- ☆13Oct 4, 2023Updated 2 years ago
- Transporter for integrating OpenLineage with OpenMetadata☆17Sep 10, 2025Updated 6 months ago
- This example shows how to run Anychart library with the Scala programming language using Akka Http and MySQL.☆11Dec 21, 2017Updated 8 years ago
- Demo repository for running eBPF in GitHub Actions☆23Mar 27, 2025Updated 11 months ago
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆26Jun 9, 2025Updated 9 months ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆40May 11, 2025Updated 10 months ago
- A DataFusion-powered Serverless S3 Proxy.☆17Apr 15, 2024Updated last year
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆37Dec 24, 2022Updated 3 years ago
- ☆22Feb 5, 2024Updated 2 years ago
- code examples C++ course spring 2020☆11Jan 10, 2022Updated 4 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 5 years ago
- Simple animation for PlantUML diagrams☆19Jul 1, 2024Updated last year
- A Table format agnostic data sharing framework☆42Feb 4, 2024Updated 2 years ago
- Testing various methods of moving Arrow data between processes☆16Mar 29, 2023Updated 2 years ago
- Malloy model examples and associated datasets☆23Feb 1, 2026Updated last month
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.☆24Nov 2, 2024Updated last year
- Capture the logical plan from Spark (SQL)☆22Mar 6, 2021Updated 5 years ago
- A Minimalistic Rust Implementation of Delta Sharing Server.☆98Mar 17, 2025Updated 11 months ago
- Big Data search with Spark and Lucene☆18Dec 15, 2023Updated 2 years ago
- ☆22Jul 18, 2024Updated last year
- ☆66May 9, 2025Updated 10 months ago
- Monitoring Databricks using Prometheus, Grafana and Pyroscope☆27Jul 29, 2025Updated 7 months ago
- A compute manifest and composable tools for data, built on Ibis, DataFusion, and Arrow Flight.☆488Updated this week
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆120Mar 5, 2025Updated last year
- 🏟☆29Nov 11, 2020Updated 5 years ago
- An arrow flight extension to support ticking datasets via IPC☆28Dec 19, 2025Updated 2 months ago
- This is the example code repository for Getting Started with Impala by John Russell (O'Reilly Media)☆22Aug 20, 2017Updated 8 years ago
- Personal project for setting up an open source data warehouse.☆32Jul 11, 2025Updated 7 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆25Mar 3, 2024Updated 2 years ago
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- Python package for querying iceberg data through duckdb.☆74Feb 12, 2024Updated 2 years ago
- Serverless Python with Ray☆59Oct 14, 2022Updated 3 years ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆124Mar 31, 2025Updated 11 months ago
- DuckDB Cron Expression Extension☆28Jun 23, 2024Updated last year
- ☆65Jan 20, 2026Updated last month
- A collection of tutorials on Akka☆26Mar 6, 2016Updated 10 years ago