LineaLabs / lineapy
Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
☆667Updated 2 months ago
Alternatives and similar repositories for lineapy
Users that are interested in lineapy are comparing it to the libraries listed below
Sorting:
- Coarse-grained lineage and tracing for machine learning pipelines.☆470Updated 2 years ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,078Updated last month
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆720Updated last year
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆500Updated 3 months ago
- 🏬 modelstore is a Python library that allows you to version, export, and save a machine learning model to your filesystem or a cloud sto…☆389Updated 4 months ago
- skops is a Python library helping you share your scikit-learn based models and put them in production☆477Updated 3 weeks ago
- An end-to-end implementation of intent prediction with Metaflow and other cool tools☆859Updated last year
- UnionML: the easiest way to build and deploy machine learning microservices☆335Updated last year
- ☆704Updated 2 years ago
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆861Updated last year
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆215Updated 2 months ago
- ML pipeline orchestration and model deployments on Kubernetes.☆435Updated last year
- Automatically profile dataframes in the Jupyter sidebar☆345Updated last year
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆699Updated last month
- Data Quality assessment with one line of code☆442Updated last week
- ☁️ Terraform plugin for machine learning workloads: spot instance recovery & auto-termination | AWS, GCP, Azure, Kubernetes☆294Updated 5 months ago
- Extra blocks for scikit-learn pipelines.☆1,329Updated 2 weeks ago
- skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.☆458Updated 2 weeks ago
- Doubt your data, find bad labels.☆512Updated 10 months ago
- Break the linear presentation of Jupyter Notebooks with sticky cells!☆567Updated last year
- Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc☆387Updated 10 months ago
- Joining the modern data stack with the modern ML stack☆197Updated last year
- Lightweight and extensible compatibility layer between dataframe libraries!☆981Updated this week
- Machine learning with dataframes☆1,385Updated last week
- just a bunch of useful embeddings for scikit-learn pipelines☆497Updated last month
- A series of Terraform based recipes to provision popular MLOps stacks on the cloud.☆255Updated 7 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation☆210Updated 6 months ago
- The Fuzzy Labs guide to the universe of open source MLOps☆461Updated 9 months ago
- Toolkit for developing and maintaining ML models☆154Updated 11 months ago