comet-ml / kangas
π¦ Explore multimedia datasets at scale
β1,057Updated 4 months ago
Alternatives and similar repositories for kangas:
Users that are interested in kangas are comparing it to the libraries listed below
- πΆ A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one dayπ€β720Updated last year
- The simplest way to serve AI/ML models in productionβ981Updated this week
- nannyml: post-deployment data science in pythonβ2,057Updated last week
- Open-source natural language enrichments at your fingertips.β458Updated 3 months ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to sβ¦β698Updated last month
- Curated list of open source tooling for data-centric AI on unstructured data.β719Updated last year
- An open-source ML pipeline development platformβ990Updated 3 months ago
- Interactively explore unstructured datasets from your dataframe.β1,170Updated 2 months ago
- The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈβ3,563Updated 7 months ago
- Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.β963Updated 2 months ago
- Temporian is an open-source Python library for preprocessing β‘ and feature engineering π temporal data π for machine learning applicatiβ¦β692Updated 9 months ago
- Break the linear presentation of Jupyter Notebooks with sticky cells!β567Updated last year
- ML pipeline orchestration and model deployments on Kubernetes.β435Updated last year
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewβ¦β2,077Updated last month
- just a bunch of useful embeddings for scikit-learn pipelinesβ497Updated last month
- The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifactβ¦β1,436Updated 4 months ago
- Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lineβ¦β666Updated 2 months ago
- AI code-writing assistant that understands data contentβ2,255Updated last year
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,784Updated 2 months ago
- aim-mlflow integrationβ210Updated last year
- A reactive Python kernel for Jupyter notebooks.β1,218Updated 2 weeks ago
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.β1,888Updated this week
- A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient wayβ281Updated last week
- Open Source Data Annotation & Labeling Toolsβ586Updated 4 months ago
- Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Graβ¦β1,799Updated 10 months ago
- Doubt your data, find bad labels.β511Updated 9 months ago
- Blazing fast framework for fine-tuning similarity learning modelsβ657Updated 3 weeks ago
- A Simple Bulk Labelling Toolβ576Updated 4 months ago
- Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvcβ387Updated 10 months ago
- π¦ Quickly annotate data from the comfort of your Jupyter notebookβ785Updated last year