jordanvolz / lolpopLinks
A software engineering framework to jump start your machine learning projects
☆37Updated last year
Alternatives and similar repositories for lolpop
Users that are interested in lolpop are comparing it to the libraries listed below
Sorting:
- Dask integration for Snowflake☆30Updated 5 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆120Updated 5 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆56Updated 6 months ago
- Abstractions for feature engineering on large graphs of tabular data.☆24Updated 2 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 2 months ago
- Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!☆139Updated 3 weeks ago
- Supporting materials/code examples for my course in data engineering for machine learning.☆39Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆39Updated 3 years ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- Linear regression in SQL using dbt☆75Updated 3 weeks ago
- The easiest way to integrate Kedro and Great Expectations☆54Updated 3 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- A library to use `modal` as a backend for `joblib`.☆32Updated last year
- An abstraction layer for parameter tuning☆35Updated last month
- Example project for building scalable data pipelines with Kedro and Ibis.☆13Updated last month
- Machine Learning Projects with Flytekit☆36Updated 2 years ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆95Updated last year
- A python library bakeoff for medium sized datasets☆24Updated 2 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆36Updated 3 years ago
- A playground for running duckdb as a stateless query engine over a data lake☆217Updated 2 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated last year
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documen…☆18Updated 2 years ago
- The ML-airport-configuration software is developed to provide a reference implementation to serve as a research example how to train and …☆28Updated 3 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated last year
- SciKIt-learn Pipeline in PAndas☆42Updated 2 years ago
- Assessing whether data from database complies with reference information.☆44Updated this week
- Experimental MLflow plugin for Google Cloud Vertex AI☆38Updated 7 months ago