iterative / datachainLinks
ETL, Analytics, Versioning for Unstructured Data
☆2,621Updated this week
Alternatives and similar repositories for datachain
Users that are interested in datachain are comparing it to the libraries listed below
Sorting:
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,762Updated 2 weeks ago
- A system for agentic LLM-powered data processing and ETL☆2,713Updated this week
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.☆3,501Updated last week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,579Updated this week
- LLM abstractions that aren't obstructions☆1,248Updated this week
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,731Updated this week
- 🦾 Take control of your AI agents☆1,360Updated 3 months ago
- dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or o…☆1,858Updated this week
- LOTUS: fast and easy LLM-powered data processing☆1,261Updated last week
- Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Lab…☆2,227Updated this week
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,400Updated this week
- Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and…☆2,236Updated 2 weeks ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,464Updated 3 months ago
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,296Updated last week
- Data transformation framework for AI. Ultra performant, with incremental processing.☆2,676Updated this week
- Things you can do with the token embeddings of an LLM☆1,445Updated 4 months ago
- Pixeltable — AI Data infrastructure providing a declarative, incremental approach for multimodal workloads.☆731Updated last week
- Create an issue on FireDucks☆920Updated 3 months ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆926Updated 6 months ago
- Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.☆964Updated this week
- The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)☆5,319Updated this week
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,634Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,054Updated 2 months ago
- Fast State-of-the-Art Static Embeddings☆1,801Updated this week
- Chat with any codebase in under two minutes | Fully local or via third-party APIs☆1,252Updated 9 months ago
- Chronon is a data platform for serving for AI/ML applications.☆835Updated last week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,375Updated 2 weeks ago
- Visual Data Preparation and Transformation. Low-Code Python-based ETL.☆1,091Updated 2 weeks ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,640Updated 3 months ago
- An open source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI artifact.☆1,159Updated this week