iterative / datachain
ETL, Analytics, Versioning for Unstructured Data
☆2,555Updated this week
Alternatives and similar repositories for datachain
Users that are interested in datachain are comparing it to the libraries listed below
Sorting:
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,665Updated this week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,601Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,421Updated this week
- Things you can do with the token embeddings of an LLM☆1,441Updated last month
- Deploy high-performance AI models and inference pipelines on FastAPI with built-in batching, streaming and more.☆3,106Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,031Updated last month
- A system for agentic LLM-powered data processing and ETL☆1,955Updated this week
- LOTUS: A semantic query engine for fast and easy LLM-powered data processing☆1,175Updated this week
- 🦾 Take control of your AI agents☆1,291Updated 2 weeks ago
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,359Updated 3 months ago
- LLM abstractions that aren't obstructions☆1,111Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,442Updated 3 months ago
- data load tool (dlt) is an open source Python library that makes data loading easy 🛠️☆3,627Updated this week
- ☆2,939Updated 8 months ago
- Concurrent Python made simple☆1,369Updated 3 months ago
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.☆1,902Updated 3 months ago
- dstack is an open-source alternative to Kubernetes and Slurm, designed to simplify GPU allocation and AI workload orchestration for ML te…☆1,787Updated this week
- Interact with your SQL database, Natural Language to SQL using LLMs☆3,494Updated 9 months ago
- Easy token price estimates for 400+ LLMs. TokenOps.☆1,658Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆2,057Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,060Updated 2 months ago
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆1,541Updated this week
- Visual Data Preparation and Transformation. Low-Code Python-based ETL.☆1,056Updated last week
- Python Stream Processing☆1,738Updated last month
- Implementing the 4 agentic patterns from scratch☆1,295Updated 2 months ago
- Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metada…☆2,130Updated last week
- Deploy your agentic worfklows to production☆2,004Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,247Updated this week
- Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Lab…☆1,979Updated this week
- Visualize decision trees in Python☆496Updated 2 months ago