iterative / datachainLinks
ETL, Analytics, Versioning for Unstructured Data
☆2,677Updated this week
Alternatives and similar repositories for datachain
Users that are interested in datachain are comparing it to the libraries listed below
Sorting:
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,805Updated this week
- A system for agentic LLM-powered data processing and ETL☆2,945Updated 2 weeks ago
- LLM abstractions that aren't obstructions☆1,272Updated this week
- dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or o…☆1,913Updated this week
- Visual Data Preparation and Transformation. Low-Code Python-based ETL.☆1,107Updated last week
- 🦾 Take control of your AI agents☆1,381Updated last month
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.☆3,578Updated 3 weeks ago
- Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, accurate query processing, that…☆1,311Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,788Updated last week
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,451Updated 8 months ago
- The modern replacement for Jupyter Notebooks☆2,166Updated 10 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,745Updated this week
- Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Lab…☆2,317Updated last week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,506Updated 4 months ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,534Updated this week
- Distributed query engine providing simple and reliable data processing for any modality and scale☆4,546Updated this week
- Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and…☆2,274Updated last week
- Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured…☆1,387Updated 2 weeks ago
- Things you can do with the token embeddings of an LLM☆1,448Updated 6 months ago
- AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data☆1,096Updated 9 months ago
- Chronon is a data platform for serving for AI/ML applications.☆920Updated last week
- An open source DevOps tool from the CNCF for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI Artifac…☆1,203Updated this week
- Fast State-of-the-Art Static Embeddings☆1,853Updated this week
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.☆2,270Updated 7 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,810Updated last month
- RAG that intelligently adapts to your use case, data, and queries☆3,535Updated 3 months ago
- Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.☆1,000Updated this week
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆1,909Updated this week
- Chat with any codebase in under two minutes | Fully local or via third-party APIs☆1,257Updated 10 months ago
- Python Stream Processing☆1,853Updated 6 months ago