datachain-ai / datachainLinks
Analytics, Versioning and ETL for multimodal data: video, audio, PDFs, images
☆2,719Updated this week
Alternatives and similar repositories for datachain
Users that are interested in datachain are comparing it to the libraries listed below
Sorting:
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,908Updated last week
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.☆3,797Updated last week
- 🦾 Take control of your AI agents☆1,389Updated 5 months ago
- The LLM Anti-Framework☆1,404Updated this week
- AI-powered Jupyter Notebook. Use AI to generate and edit code cells, automatically fix errors, and chat with your data☆1,099Updated last month
- AdalFlow: The library to build & auto-optimize LLM applications.☆4,010Updated last week
- AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, acc…☆1,545Updated last week
- A system for agentic LLM-powered data processing and ETL☆3,525Updated last week
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,458Updated last year
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,924Updated 5 months ago
- Chat with any codebase in under two minutes | Fully local or via third-party APIs☆1,263Updated last year
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,840Updated this week
- Chronon is a data platform for serving for AI/ML applications.☆964Updated this week
- Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and…☆2,387Updated this week
- The modern replacement for Jupyter Notebooks☆2,182Updated last year
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆933Updated last year
- dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or o…☆2,022Updated this week
- Open-source platform for extracting structured data from documents using AI.☆1,464Updated 8 months ago
- visual data prep powered by python☆1,345Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,525Updated 8 months ago
- Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.☆1,050Updated this week
- Laminar - open-source observability platform purpose-built for AI agents. YC S24.☆2,566Updated this week
- Things you can do with the token embeddings of an LLM☆1,452Updated 2 months ago
- Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured…☆1,489Updated last month
- Fast State-of-the-Art Static Embeddings☆1,992Updated last month
- An SDK for working with LLMs and AI Agents from Apache Airflow, based on Pydantic AI☆520Updated 4 months ago
- A realtime serving engine for Data-Intensive Generative AI Applications☆1,095Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,483Updated 5 months ago
- Postgres extension for vector search (DiskANN), complements pgvector for performance and scale. Postgres OSS licensed.☆2,815Updated last month
- Create an issue on FireDucks☆932Updated 8 months ago