LineaLabs / lineapy
Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
☆665Updated this week
Alternatives and similar repositories for lineapy:
Users that are interested in lineapy are comparing it to the libraries listed below
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,040Updated 4 months ago
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆498Updated 3 weeks ago
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆718Updated last year
- UnionML: the easiest way to build and deploy machine learning microservices☆335Updated last year
- Coarse-grained lineage and tracing for machine learning pipelines.☆467Updated 2 years ago
- 🏬 modelstore is a Python library that allows you to version, export, and save a machine learning model to your filesystem or a cloud sto…☆383Updated last month
- skops is a Python library helping you share your scikit-learn based models and put them in production☆465Updated 2 weeks ago
- Data Quality assessment with one line of code☆434Updated 3 weeks ago
- Doubt your data, find bad labels.☆508Updated 7 months ago
- Break the linear presentation of Jupyter Notebooks with sticky cells!☆561Updated last year
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆861Updated last year
- Lightweight and extensible compatibility layer between dataframe libraries!☆818Updated this week
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.☆427Updated 2 weeks ago
- Creates dynamic html report from jupyter notebook.☆306Updated 2 months ago
- Natural Intelligence is still a pretty good idea.☆801Updated 7 months ago
- Extra blocks for scikit-learn pipelines.☆1,304Updated 3 weeks ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆691Updated last month
- Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc☆385Updated 7 months ago
- ML pipeline orchestration and model deployments on Kubernetes.☆437Updated last year
- Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of p…☆324Updated this week
- The Fuzzy Labs guide to the universe of open source MLOps☆454Updated 7 months ago
- EvalML is an AutoML library written in python.☆801Updated this week
- Prepping tables for machine learning☆1,297Updated this week
- Distributed XGBoost on Ray☆147Updated 7 months ago
- Better SQL in Jupyter. 📊☆745Updated last week
- ☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes☆292Updated 2 months ago
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆207Updated this week
- Recommendations at "Reasonable Scale": joining dataOps with recSys through dbt, Merlin and Metaflow☆234Updated last year
- ☆703Updated 2 years ago
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆856Updated last year