the GPU-native, sandboxed Postgres for AI agents
β9,040Feb 16, 2026Updated last month
Alternatives and similar repositories for deeplake
Users that are interested in deeplake are comparing it to the libraries listed below
Sorting:
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β30,926Mar 10, 2026Updated last week
- Aim π« β An easy-to-use & supercharged open-source experiment tracker.β6,033Updated this week
- ZenML π: One AI Platform from Pipelines to Agents. https://zenml.io.β5,281Updated this week
- The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!β8,520Updated this week
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and aβ¦β24,519Updated this week
- βοΈ Build multimodal AI applications with cloud-native stackβ21,849Mar 24, 2025Updated 11 months ago
- The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, aβ¦β24,730Updated this week
- π¦ Data Versioning and ML Experimentsβ15,448Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.β41,773Updated this week
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,053Updated this week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,807Updated this week
- Build, Manage and Deploy AI/ML Systemsβ9,956Updated this week
- Tensors and Dynamic neural networks in Python with strong GPU accelerationβ98,243Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and moreβ35,108Updated this week
- ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling β¦β6,567Updated this week
- Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data β¦β11,368Jan 13, 2026Updated 2 months ago
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,291Updated this week
- A library for efficient similarity search and clustering of dense vectors.β39,403Updated this week
- A data augmentations library for audio, image, text, and video.β5,070Updated this week
- LlamaIndex is the leading document agent and OCR platformβ47,753Updated this week
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β158,060Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the clβ¦β29,611Updated this week
- Streamlit β A faster way to build and share data apps.β43,928Updated this week
- π€ The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation toolsβ21,289Updated this week
- βΎοΈ CML - Continuous Machine Learning | CI/CD for MLβ4,170Jun 2, 2025Updated 9 months ago
- Low-code framework for building custom LLMs, neural networks, and other AI modelsβ11,657Updated this week
- Label Studio is a multi-type data labeling and annotation tool with standardized output formatβ26,726Updated this week
- Milvus is a high-performance, cloud-native vector database built for scalable vector ANN searchβ43,296Updated this week
- Evidently is ββan open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. Froβ¦β7,308Mar 10, 2026Updated last week
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with strucβ¦β15,795Updated this week
- π Geometric Computer Vision Library for Spatial AIβ11,121Updated this week
- Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering andβ¦β10,790Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β21,856Updated this week
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML vaβ¦β3,990Dec 28, 2025Updated 2 months ago
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learningβ20,255Mar 5, 2026Updated 2 weeks ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,191Sep 30, 2025Updated 5 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,896Updated this week
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model cβ¦β14,344Jul 3, 2024Updated last year
- Google Researchβ37,452Mar 12, 2026Updated last week