Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
β669Feb 22, 2025Updated last year
Alternatives and similar repositories for lineapy
Users that are interested in lineapy are comparing it to the libraries listed below
Sorting:
- An open-source data logging library for machine learning models and data pipelines. π Provides visibility into data quality & model perfβ¦β2,800Jan 10, 2025Updated last year
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,138Feb 21, 2026Updated last week
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,624May 29, 2025Updated 9 months ago
- nannyml: post-deployment data science in pythonβ2,125Jul 12, 2025Updated 7 months ago
- Jupyter extensions that help you write code faster: Context aware AI Chat, Autocomplete, and Spreadsheetβ2,614Updated this week
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,979Dec 28, 2025Updated 2 months ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to sβ¦β739Updated this week
- πΆ A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one dayπ€β719Sep 13, 2023Updated 2 years ago
- Automatically visualize your pandas dataframe via a single print! π π‘β5,371Mar 20, 2024Updated last year
- Streamline scikit-learn model comparison.β143Dec 21, 2022Updated 3 years ago
- A light-weight, flexible, and expressive statistical data testing libraryβ4,210Feb 19, 2026Updated last week
- A reactive Python kernel for Jupyter notebooks.β1,266Jan 16, 2026Updated last month
- ForML - A development framework and MLOps platform for the lifecycle management of data science projectsβ107May 29, 2023Updated 2 years ago
- π Online machine learning in Pythonβ5,726Feb 9, 2026Updated 2 weeks ago
- dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xlβ¦β1,606Feb 1, 2026Updated 3 weeks ago
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β7,227Updated this week
- Doubt your data, find bad labels.β517Jul 15, 2024Updated last year
- ZenML π: One AI Platform from Pipelines to Agents. https://zenml.io.β5,228Updated this week
- π¦ Explore multimedia datasets at scaleβ1,062Dec 7, 2024Updated last year
- the portable Python dataframe libraryβ6,404Feb 21, 2026Updated last week
- Lightning β‘οΈ fast forecasting with statistical and econometric models.β4,698Updated this week
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.β1,966Jul 3, 2025Updated 7 months ago
- An open-source ML pipeline development platformβ997Jan 9, 2025Updated last year
- UnionML: the easiest way to build and deploy machine learning microservicesβ336Nov 6, 2023Updated 2 years ago
- Panel: The powerful data exploration & web app framework for Pythonβ5,606Updated this week
- Interpretable ML package π for concise, transparent, and accurate predictive modeling (sklearn-compatible).β1,573Feb 4, 2026Updated 3 weeks ago
- Modin: Scale your Pandas workflows by changing a single line of codeβ10,362Feb 10, 2026Updated 2 weeks ago
- Create web apps from Python notebooksβ4,295Feb 9, 2026Updated 2 weeks ago
- Machine learning with dataframesβ1,568Updated this week
- Build, Manage and Deploy AI/ML Systemsβ9,863Updated this week
- π Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Modelsβ3,147Feb 6, 2026Updated 3 weeks ago
- Always know what to expect from your data.β11,162Feb 20, 2026Updated last week
- Extra blocks for scikit-learn pipelines.β1,379Feb 12, 2026Updated 2 weeks ago
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,768Updated this week
- Coarse-grained lineage and tracing for machine learning pipelines.β471Nov 11, 2022Updated 3 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per sβ¦β8,475Feb 5, 2026Updated 3 weeks ago
- dstack is an open-source control plane for running development, training, and inference jobs on GPUsβacross hyperscalers, neoclouds, or oβ¦β2,049Updated this week
- Represent, send, store and search multimodal data