BauplanLabs / examples
reference implementations and use cases done with bauplan
☆47Updated 2 weeks ago
Alternatives and similar repositories for examples:
Users that are interested in examples are comparing it to the libraries listed below
- A playground for running duckdb as a stateless query engine over a data lake☆197Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 8 months ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆107Updated 2 years ago
- The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆☆178Updated this week
- Supporting materials/code examples for my course in data engineering for machine learning.☆38Updated 2 years ago
- An experimental Athena extension for DuckDB 🐤☆54Updated 3 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆108Updated 4 months ago
- ☆32Updated last year
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆70Updated last month
- ☆26Updated 2 years ago
- Deploy production-grade Metaflow cloud infrastructure on AWS☆67Updated this week
- Repo for orienting dbt users to the Dagster asset framework☆54Updated 2 years ago
- CLI to create an ER Diagram from DuckDB database files☆120Updated last month
- Python package for querying iceberg data through duckdb.☆64Updated last year
- Dask integration for Snowflake☆30Updated 5 months ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- A python library bakeoff for medium sized datasets☆24Updated last year
- Run dbt serverless in the Cloud (AWS)☆42Updated 5 years ago
- A DuckDB-powered command line interface for Snowflake security, governance, operations, and cost optimization.☆40Updated 8 months ago
- An example of how to run DuckDB on AWS Lambda & API Gateway.☆145Updated 3 weeks ago
- Joining the modern data stack with the modern ML stack☆195Updated last year
- A Rust based data/CSV/Parquet file generator☆51Updated last month
- Assessing whether data from database complies with reference information.☆42Updated this week
- Serverless multi-protocol + multi-destination event collection system.☆202Updated 5 months ago
- FlockMTL: DuckDB extension to seamlessly combine analytics and semantic analysis using language models (LMs)☆194Updated this week
- A series of Terraform based recipes to provision popular MLOps stacks on the cloud.☆255Updated 6 months ago
- ☆51Updated 2 weeks ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- Snowflake bring-your-own-cloud option. Run Snowflake as a microservice on your own compute☆30Updated this week