π¦ Explore multimedia datasets at scale
β1,062Dec 7, 2024Updated last year
Alternatives and similar repositories for kangas
Users that are interested in kangas are comparing it to the libraries listed below
Sorting:
- A recurrent neural network paired with heuristic methods that automatically infer geospatial, temporal and feature columnsβ184Mar 24, 2025Updated 11 months ago
- Open-source natural language enrichments at your fingertips.β462Jan 14, 2025Updated last year
- Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data β¦β11,346Jan 13, 2026Updated last month
- An open-source ML pipeline development platformβ997Jan 9, 2025Updated last year
- Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lineβ¦β669Feb 22, 2025Updated last year
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,982Dec 28, 2025Updated 2 months ago
- The simplest way to serve AI/ML models in productionβ1,122Updated this week
- ZenML π: One AI Platform from Pipelines to Agents. https://zenml.io.β5,228Updated this week
- nannyml: post-deployment data science in pythonβ2,126Jul 12, 2025Updated 7 months ago
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,624May 29, 2025Updated 9 months ago
- πΆ A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one dayπ€β719Sep 13, 2023Updated 2 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,875Feb 23, 2026Updated last week
- A light-weight, flexible, and expressive statistical data testing libraryβ4,212Feb 19, 2026Updated last week
- Create web apps from Python notebooksβ4,295Feb 9, 2026Updated 3 weeks ago
- AI code-writing assistant that understands data contentβ2,288Feb 8, 2024Updated 2 years ago
- π Online machine learning in Pythonβ5,726Feb 9, 2026Updated 3 weeks ago
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β7,227Feb 24, 2026Updated last week
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.β1,966Jul 3, 2025Updated 8 months ago
- Lumi is an nano framework to convert your python functions into a REST API without any extra headache.β625Dec 22, 2022Updated 3 years ago
- Containers for machine learningβ9,252Updated this week
- Streamline scikit-learn model comparison.β143Dec 21, 2022Updated 3 years ago
- Lightning β‘οΈ fast forecasting with statistical and econometric models.β4,698Updated this week
- Automatically visualize your pandas dataframe via a single print! π π‘β5,371Mar 20, 2024Updated last year
- Aim π« β An easy-to-use & supercharged open-source experiment tracker.β6,017Updated this week
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,139Feb 21, 2026Updated last week
- Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Streβ¦β9,012Feb 16, 2026Updated 2 weeks ago
- Represent, send, store and search multimodal dataβ3,115Jan 13, 2026Updated last month
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,771Updated this week
- fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the qualiβ¦β1,833Feb 18, 2026Updated last week
- π¬ modelstore is a Python library that allows you to version, export, and save a machine learning model to your filesystem or a cloud stoβ¦β401Feb 22, 2026Updated last week
- A Simple Bulk Labelling Toolβ599Jul 29, 2025Updated 7 months ago
- Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing andβ¦β2,413Updated this week
- Interactively explore unstructured datasets from your dataframe.β1,250Updated this week
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β8,472Updated this week
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,210Feb 22, 2026Updated last week
- Build data pipelines, the easy way π οΈβ4,139Jun 6, 2023Updated 2 years ago
- π Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Modelsβ3,147Feb 6, 2026Updated 3 weeks ago
- An open-source, low-code machine learning library in Pythonβ9,706Apr 21, 2025Updated 10 months ago
- Always know what to expect from your data.β11,197Updated this week