iterative / datachainLinks
ETL, Analytics, Versioning for Unstructured Data
☆2,584Updated this week
Alternatives and similar repositories for datachain
Users that are interested in datachain are comparing it to the libraries listed below
Sorting:
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,685Updated this week
- A system for agentic LLM-powered data processing and ETL☆2,273Updated last week
- Create an issue on FireDucks☆917Updated 3 weeks ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆3,328Updated 2 months ago
- Things you can do with the token embeddings of an LLM☆1,444Updated 2 months ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,586Updated 3 weeks ago
- LOTUS: A semantic query engine for fast and easy LLM-powered data processing☆1,199Updated 3 weeks ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,064Updated this week
- A portable accelerated data query and LLM-inference engine, written in Rust, for data-grounded AI apps and agents.☆2,460Updated this week
- Fast State-of-the-Art Static Embeddings☆1,740Updated 2 weeks ago
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.☆3,298Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,015Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,136Updated last month
- RAG that intelligently adapts to your use case, data, and queries☆3,320Updated 2 months ago
- The modern replacement for Jupyter Notebooks☆2,134Updated 6 months ago
- Superlinked is a Python framework for AI Engineers building high-performance search & recommendation applications that combine structured…☆1,164Updated this week
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆1,964Updated this week
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆921Updated 4 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,024Updated last month
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,281Updated last week
- A complement to pgvector for high performance, cost efficient vector search on large workloads.☆2,049Updated this week
- Korvus is a search SDK that unifies the entire RAG pipeline in a single database query. Built on top of Postgres with bindings for Python…☆1,372Updated 4 months ago
- Improved file parsing for LLM’s☆3,002Updated 7 months ago
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,088Updated this week
- Chronon is a data platform for serving for AI/ML applications.☆817Updated this week
- Implementing the 4 agentic patterns from scratch☆1,375Updated 3 months ago
- dstack is an open-source alternative to Kubernetes and Slurm, designed to simplify GPU allocation and AI workload orchestration for ML te…☆1,802Updated this week
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.☆2,053Updated 4 months ago
- Knowledge Agents and Management in the Cloud☆4,014Updated this week
- ZenML 🙏: The bridge between ML and Ops. https://zenml.io.☆4,637Updated this week