datachain-ai / datachainLinks
Analytics, Versioning and ETL for multimodal data: video, audio, PDFs, images
☆2,713Updated this week
Alternatives and similar repositories for datachain
Users that are interested in datachain are comparing it to the libraries listed below
Sorting:
- Build custom inference engines for models, agents, multi-modal systems, RAG, pipelines and more.☆3,728Updated last week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,864Updated last week
- 🦾 Take control of your AI agents☆1,387Updated 3 months ago
- LLM abstractions that aren't obstructions☆1,322Updated this week
- AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, acc…☆1,483Updated last week
- dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or o…☆1,976Updated this week
- Visual Data Preparation and Transformation. Low-Code Python-based ETL.☆1,291Updated last week
- A system for agentic LLM-powered data processing and ETL☆3,251Updated 2 weeks ago
- Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and…☆2,329Updated last week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,916Updated this week
- Create an issue on FireDucks☆932Updated 6 months ago
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,454Updated 10 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,783Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,561Updated this week
- Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.☆1,029Updated this week
- Chat with any codebase in under two minutes | Fully local or via third-party APIs☆1,263Updated last year
- Things you can do with the token embeddings of an LLM☆1,450Updated 2 weeks ago
- A realtime serving engine for Data-Intensive Generative AI Applications☆1,080Updated this week
- Chronon is a data platform for serving for AI/ML applications.☆946Updated this week
- Fast State-of-the-Art Static Embeddings☆1,948Updated last month
- Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Lab…☆2,472Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,776Updated last week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,692Updated last week
- Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured…☆1,452Updated last month
- Concurrent Python made simple☆1,512Updated 10 months ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,672Updated this week
- AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data☆1,096Updated 11 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,457Updated 3 months ago
- Seamlessly integrate LLMs as Python functions☆2,384Updated 3 weeks ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,518Updated 6 months ago