datachain-ai / datachainLinks
Analytics, Versioning and ETL for multimodal data: video, audio, PDFs, images
☆2,719Updated this week
Alternatives and similar repositories for datachain
Users that are interested in datachain are comparing it to the libraries listed below
Sorting:
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,889Updated last week
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.☆3,784Updated 2 weeks ago
- 🦾 Take control of your AI agents☆1,388Updated 5 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,817Updated this week
- dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or o…☆2,015Updated this week
- Laminar - open-source observability platform purpose-built for AI agents. YC S24.☆2,539Updated this week
- AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, acc…☆1,533Updated last week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,520Updated 8 months ago
- Build. Observe. Iterate. Ship.☆1,347Updated this week
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,912Updated 4 months ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,998Updated 3 weeks ago
- A system for agentic LLM-powered data processing and ETL☆3,416Updated last week
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,460Updated 11 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,480Updated 4 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,639Updated 2 weeks ago
- Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured…☆1,483Updated last month
- Things you can do with the token embeddings of an LLM☆1,450Updated last month
- visual data prep powered by python☆1,296Updated last week
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,743Updated this week
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆2,168Updated this week
- The modern replacement for Jupyter Notebooks☆2,178Updated last year
- AI-powered Jupyter Notebook. Use AI to generate and edit code cells, automatically fix errors, and chat with your data☆1,099Updated last month
- Chat with any codebase in under two minutes | Fully local or via third-party APIs☆1,262Updated last year
- Fast State-of-the-Art Static Embeddings☆1,986Updated 3 weeks ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆933Updated 11 months ago
- Create an issue on FireDucks☆933Updated 8 months ago
- Chronon is a data platform for serving for AI/ML applications.☆959Updated this week
- Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and…☆2,372Updated 2 weeks ago
- Interact with your SQL database, Natural Language to SQL using LLMs☆3,602Updated last year
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,133Updated this week