BauplanLabs / examplesLinks
reference implementations and use cases done with bauplan
☆60Updated last week
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below
Sorting:
- A playground for running duckdb as a stateless query engine over a data lake☆211Updated last year
- Python package for querying iceberg data through duckdb.☆70Updated last year
- Pushdown compute from Snowflake to DuckDB running on your infrastructure☆192Updated this week
- Build reliable AI and agentic applications with DataFrames☆345Updated this week
- 🏃♀️ Minimalist SQL orchestrator☆263Updated this week
- An example of how to run DuckDB on AWS Lambda & API Gateway.☆205Updated 4 months ago
- multi-engine batch transformation framework☆448Updated this week
- The smallest DuckDB SQL orchestrator on Earth.☆325Updated 5 months ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated last year
- ☆158Updated 4 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆116Updated 2 months ago
- Packaging DuckDB for Node.js Lambda functions. Example application: https://github.com/tobilg/serverless-duckdb☆144Updated this week
- Turning PySpark Into a Universal DataFrame API☆434Updated this week
- Possibly the fastest DataFrame-agnostic quality check library in town.☆220Updated this week
- CLI to create an ER Diagram from DuckDB database files☆135Updated 7 months ago
- Flock: multimodal querying for DuckDB☆272Updated 3 weeks ago
- Quickstart for any service☆163Updated this week
- Work with your web service, database, and streaming schemas in a single format.☆343Updated last month
- DuckDB for streaming data☆632Updated last month
- A Python framework for defining and querying BI models in your data warehouse☆169Updated 8 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- ☆265Updated last week
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆89Updated 7 months ago
- Anomstack - Painless open source anomaly detection for your metrics 📈📉🚀☆105Updated last week
- A Python package for the statistical analysis of A/B tests.☆309Updated 2 weeks ago
- A FastMCP tool to search and retrieve Polars API documentation.☆68Updated 4 months ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- Sord Data Fabric: A Vue 3 frontend with a Python WebSocket server, leveraging a distributed architecture with DeltaLake and DuckDB worker…☆18Updated last year
- This repo contains information about DuckDB extensions found on GitHub. Refreshed daily☆102Updated this week