reference implementations and use cases done with bauplan
☆62Mar 30, 2026Updated 3 months ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository for EvalRS @ KDD 2023: a Rounded Evaluation of Recommender Systems☆30Feb 16, 2024Updated 2 years ago
- A playground for running duckdb as a stateless query engine over a data lake☆221Jan 10, 2024Updated 2 years ago
- Materials for my 2021 NYU class on NLP and ML Systems (Master of Engineering).☆96Dec 19, 2022Updated 3 years ago
- ☆12Oct 25, 2023Updated 2 years ago
- Official Repository for EvalRS @ CIKM 2022: a Rounded Evaluation of Recommender Systems☆71Mar 21, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Playground for using large language models into the Modern Data Stack for entity matching☆108Apr 1, 2023Updated 3 years ago
- ☆23Jun 28, 2022Updated 4 years ago
- A platform to manage the data product life cycle☆22Mar 25, 2026Updated 3 months ago
- a tool for defining repeatable processes in code☆13Oct 29, 2019Updated 6 years ago
- How to evaluate the Quality of your Data with Great Expectations and Spark.☆32Mar 29, 2023Updated 3 years ago
- A PaaS End-to-End ML Setup with Metaflow, Serverless and SageMaker.☆37Feb 10, 2021Updated 5 years ago
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆42Aug 20, 2024Updated last year
- ☆21Jul 23, 2025Updated 11 months ago
- Recommendations at "Reasonable Scale": joining dataOps with recSys through dbt, Merlin and Metaflow☆239Apr 7, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- FashionCLIP is a CLIP-like model fine-tuned for the fashion domain.☆521Jan 30, 2025Updated last year
- Example code to create high-quality knowledge graphs using entity resolution with Kuzu and Senzing☆25Sep 17, 2025Updated 9 months ago
- Behavioral "black-box" testing for recommender systems☆474Aug 9, 2023Updated 2 years ago
- A write-audit-publish implementation on a data lake without the JVM☆45Aug 12, 2024Updated last year
- Artifacts of the EKGF Data Product Workgroup (DPROD)☆35Jun 17, 2026Updated 2 weeks ago
- Demo repository to lambda-fy your dbt runs☆11Sep 7, 2023Updated 2 years ago
- ☆10Nov 1, 2022Updated 3 years ago
- ☆35Jul 23, 2023Updated 2 years ago
- A dbt package to run natural language queries☆10Jan 13, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- adapt data to and from every format☆28Apr 27, 2026Updated 2 months ago
- Unleash the performance potential of your Parquet files.☆52Feb 24, 2026Updated 4 months ago
- Testing various methods of moving Arrow data between processes☆17Mar 29, 2023Updated 3 years ago
- Pytest plugin type-checking tests, fixtures, and/or your codebase with @beartype.☆25Jun 19, 2026Updated 2 weeks ago
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- An implementation of Defeasible Logic in Python☆15Sep 2, 2018Updated 7 years ago
- An R library for working with Table Schema.☆27Apr 10, 2025Updated last year
- Open Source Data Contracts In JSON to UNIFY understanding and efforts efficiently☆16Dec 16, 2022Updated 3 years ago
- A project to define an RDF style ontology for wines and the wine-industry☆24Aug 11, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- DuckDB CronJob Extension☆50Mar 29, 2026Updated 3 months ago
- GitHub Pages documenting Open Data Mesh Platform☆13Nov 4, 2025Updated 8 months ago
- Content published on social channels☆17Apr 5, 2025Updated last year
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆876Jun 16, 2023Updated 3 years ago
- Python+node wrapper to read/send message from/to Anki Overdrive bluetooth vehicles.☆18Aug 9, 2022Updated 3 years ago
- ☆20Apr 12, 2024Updated 2 years ago
- Managing Data as a Product, published by Packt☆23Nov 30, 2024Updated last year