Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
☆669Feb 22, 2025Updated last year
Alternatives and similar repositories for lineapy
Users that are interested in lineapy are comparing it to the libraries listed below
Sorting:
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,142Mar 12, 2026Updated last week
- An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model perf…☆2,802Jan 10, 2025Updated last year
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,623May 29, 2025Updated 9 months ago
- Jupyter extensions that help you write code faster: Context aware AI Chat, Autocomplete, and Spreadsheet☆2,618Mar 13, 2026Updated last week
- nannyml: post-deployment data science in python☆2,132Jul 12, 2025Updated 8 months ago
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…☆3,994Dec 28, 2025Updated 2 months ago
- A reactive Python kernel for Jupyter notebooks.☆1,265Jan 16, 2026Updated 2 months ago
- dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xl…☆1,606Mar 1, 2026Updated 2 weeks ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆740Updated this week
- Streamline scikit-learn model comparison.☆143Dec 21, 2022Updated 3 years ago
- A light-weight, flexible, and expressive statistical data testing library☆4,259Updated this week
- Automatically visualize your pandas dataframe via a single print! 📊 💡☆5,370Mar 20, 2024Updated 2 years ago
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆718Sep 13, 2023Updated 2 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆106May 29, 2023Updated 2 years ago
- Coarse-grained lineage and tracing for machine learning pipelines.☆471Nov 11, 2022Updated 3 years ago
- the portable Python dataframe library☆6,457Updated this week
- Lightning ⚡️ fast forecasting with statistical and econometric models.☆4,721Updated this week
- Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Fro…☆7,308Mar 10, 2026Updated last week
- 🌊 Online machine learning in Python☆5,752Mar 9, 2026Updated last week
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.☆1,967Jul 3, 2025Updated 8 months ago
- ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.☆5,281Updated this week
- Doubt your data, find bad labels.☆516Jul 15, 2024Updated last year
- Aqueduct is no longer being maintained. Aqueduct allows you to run LLM and ML workloads on any cloud infrastructure.☆520Jun 7, 2023Updated 2 years ago
- 🦘 Explore multimedia datasets at scale☆1,063Dec 7, 2024Updated last year
- Create web apps from Python notebooks☆4,300Feb 9, 2026Updated last month
- An open-source ML pipeline development platform☆998Jan 9, 2025Updated last year
- Modin: Scale your Pandas workflows by changing a single line of code☆10,363Feb 10, 2026Updated last month
- Build, Manage and Deploy AI/ML Systems☆9,956Updated this week
- Always know what to expect from your data.☆11,257Mar 14, 2026Updated last week
- Panel: The powerful data exploration & web app framework for Python☆5,627Mar 14, 2026Updated last week
- Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).☆1,573Feb 24, 2026Updated 3 weeks ago
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and…☆10,790Mar 13, 2026Updated last week
- Extra blocks for scikit-learn pipelines.☆1,383Mar 12, 2026Updated last week
- Feathr – A scalable, unified data and AI engineering platform for enterprise☆1,927Apr 4, 2024Updated last year
- dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or o…☆2,065Updated this week
- Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.☆1,168Oct 4, 2025Updated 5 months ago
- Machine learning with dataframes☆1,579Mar 13, 2026Updated last week
- UnionML: the easiest way to build and deploy machine learning microservices☆336Nov 6, 2023Updated 2 years ago
- Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.☆6,033Updated this week