A write-audit-publish implementation on a data lake without the JVM
☆45Aug 12, 2024Updated last year
Alternatives and similar repositories for no-jvm-wap-with-iceberg
Users that are interested in no-jvm-wap-with-iceberg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A playground for running duckdb as a stateless query engine over a data lake☆221Jan 10, 2024Updated 2 years ago
- Testing various methods of moving Arrow data between processes☆17Mar 29, 2023Updated 3 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Mar 24, 2026Updated 2 months ago
- python library for iceberg lake house on your local☆14Jan 8, 2026Updated 5 months ago
- How to evaluate the Quality of your Data with Great Expectations and Spark.☆32Mar 29, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SQL query executor on remote DuckDB instance using Apache Arrow Flight RPC through Streamlit Web interface.☆25Nov 2, 2024Updated last year
- Example gaming leaderboard application covering streaming ingestion, CDC enrichment, processing and visualisation including demo of advan…☆21Nov 18, 2025Updated 7 months ago
- Demo repository to lambda-fy your dbt runs☆11Sep 7, 2023Updated 2 years ago
- A DataFusion-powered Serverless S3 Proxy.☆17Apr 15, 2024Updated 2 years ago
- Open-source agentic schema CLI. Optimised for claude code, gemini, codex and co-pilot. Skills included.☆48May 14, 2026Updated last month
- Serve a 1x1 GIF pixel from an AWS lambda-powered endpoint☆13Sep 7, 2017Updated 8 years ago
- A "modern" Strava data pipeline fueled by dlt, duckdb, dbt, and evidence.dev☆40May 11, 2025Updated last year
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆37Dec 24, 2022Updated 3 years ago
- A dbt package to run natural language queries☆10Jan 13, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Transporter for integrating OpenLineage with OpenMetadata☆18Sep 10, 2025Updated 9 months ago
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆123Mar 5, 2025Updated last year
- ☆193May 21, 2025Updated last year
- Malloy model examples and associated datasets☆23Jun 11, 2026Updated last week
- Helm chart for Lakekeeper - a Rust Native Iceberg REST Catalog☆24Updated this week
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆29Dec 7, 2021Updated 4 years ago
- Anki Overdrive API for Python☆12Oct 21, 2017Updated 8 years ago
- ☆22Mar 31, 2022Updated 4 years ago
- Unleash the performance potential of your Parquet files.☆53Feb 24, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- End to end data engineering project☆59Oct 27, 2022Updated 3 years ago
- GAPandas4 is a Python package for querying the Google Analytics Data API for GA4 and displaying the results in a Pandas dataframe.☆34Jul 6, 2022Updated 3 years ago
- Building a poor man's data lake: Exploring the Power of Polars and Delta Lake☆11Dec 6, 2025Updated 6 months ago
- ☆12Oct 25, 2023Updated 2 years ago
- Apache Spark Connect Client for Rust☆116Jun 10, 2025Updated last year
- ☆13Oct 4, 2023Updated 2 years ago
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆96Feb 22, 2025Updated last year
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Datalog implementation in Scala.☆12Jun 17, 2014Updated 12 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A flake8 plugin that detects of usage withColumn in a loop or inside reduce☆28Jun 20, 2025Updated 11 months ago
- Create Django REST APIs the right way, no magic intended☆11Dec 8, 2022Updated 3 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 5 years ago
- scraping and querying documents for LLMs☆24Oct 6, 2025Updated 8 months ago
- Personal project for setting up an open source data warehouse.☆32Jul 11, 2025Updated 11 months ago
- Executable memory system for tabular data work☆519Jun 12, 2026Updated last week
- reference implementations and use cases done with bauplan☆62Mar 30, 2026Updated 2 months ago