reference implementations and use cases done with bauplan
☆62Mar 30, 2026Updated last month
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A playground for running duckdb as a stateless query engine over a data lake☆221Jan 10, 2024Updated 2 years ago
- Materials for my 2021 NYU class on NLP and ML Systems (Master of Engineering).☆97Dec 19, 2022Updated 3 years ago
- ☆12Oct 25, 2023Updated 2 years ago
- Official Repository for EvalRS @ CIKM 2022: a Rounded Evaluation of Recommender Systems☆71Mar 21, 2024Updated 2 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Apr 1, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆23Jun 28, 2022Updated 3 years ago
- A platform to manage the data product life cycle☆22Mar 25, 2026Updated 2 months ago
- a tool for defining repeatable processes in code☆13Oct 29, 2019Updated 6 years ago
- A PaaS End-to-End ML Setup with Metaflow, Serverless and SageMaker.☆37Feb 10, 2021Updated 5 years ago
- Joining the modern data stack with the modern ML stack☆202May 16, 2023Updated 3 years ago
- Simple examples showing how to use ADBC with various databases, query engines, and data platforms☆46Updated this week
- Recommendations at "Reasonable Scale": joining dataOps with recSys through dbt, Merlin and Metaflow☆241Apr 7, 2023Updated 3 years ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Mar 26, 2025Updated last year
- Example code to create high-quality knowledge graphs using entity resolution with Kuzu and Senzing☆24Sep 17, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A write-audit-publish implementation on a data lake without the JVM☆45Aug 12, 2024Updated last year
- Artifacts of the EKGF Data Product Workgroup (DPROD)☆36May 7, 2026Updated 2 weeks ago
- Demo repository to lambda-fy your dbt runs☆11Sep 7, 2023Updated 2 years ago
- ☆10Nov 1, 2022Updated 3 years ago
- ☆35Jul 23, 2023Updated 2 years ago
- A dbt package to run natural language queries☆10Jan 13, 2023Updated 3 years ago
- adapt data to and from every format☆28Apr 27, 2026Updated 3 weeks ago
- ☆22Mar 31, 2022Updated 4 years ago
- ☆20Jan 3, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Unleash the performance potential of your Parquet files.☆53Feb 24, 2026Updated 3 months ago
- The Data Product Descriptor Specification (DPDS) Repository☆83Jan 14, 2025Updated last year
- Testing various methods of moving Arrow data between processes☆17Mar 29, 2023Updated 3 years ago
- Pytest plugin type-checking tests, fixtures, and/or your codebase with @beartype.☆24Apr 15, 2026Updated last month
- DuckDB CronJob Extension☆49Mar 29, 2026Updated last month
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆874Jun 16, 2023Updated 2 years ago
- Python+node wrapper to read/send message from/to Anki Overdrive bluetooth vehicles.☆18Aug 9, 2022Updated 3 years ago
- A Domain-Specific Language, Jailbreak Attack Synthesizer and Dynamic LLM Redteaming Toolkit☆27Dec 5, 2024Updated last year
- A Write-Ahead Log (WAL) design built exclusively on object storage primitives.☆53Oct 4, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Managing Data as a Product, published by Packt☆21Nov 30, 2024Updated last year
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Apr 8, 2025Updated last year
- e-Rum2020 CovidR Contest☆19Feb 18, 2023Updated 3 years ago
- C inference engine for running GLiClass (Generalist and Lightweight Classification) models☆17May 21, 2025Updated last year
- A lightweight and flexible analysis pipeline☆12May 14, 2026Updated last week
- 🏟☆28Nov 11, 2020Updated 5 years ago
- Shared code for training sentence embeddings with Flax / JAX☆28Jul 15, 2021Updated 4 years ago