ucbepic / docetlLinks
A system for agentic LLM-powered data processing and ETL
☆3,501Updated 2 weeks ago
Alternatives and similar repositories for docetl
Users that are interested in docetl are comparing it to the libraries listed below
Sorting:
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,480Updated 5 months ago
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,750Updated this week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,522Updated 8 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,446Updated 9 months ago
- AI-Powered Data Processing: Use LOTUS to process all of your datasets with LLMs and embeddings. Enjoy up to 1000x speedups with fast, acc…☆1,537Updated last week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,133Updated last week
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆802Updated 2 weeks ago
- AdalFlow: The library to build & auto-optimize LLM applications.☆4,005Updated this week
- Knowledge Agents and Management in the Cloud☆4,231Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,932Updated 4 months ago
- Fast State-of-the-Art Static Embeddings☆1,990Updated last month
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆1,101Updated last week
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,464Updated 8 months ago
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,830Updated this week
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆933Updated last year
- Enterprise-grade and API-first LLM workspace for unstructured documents, including data extraction, redaction, rights management, prompt …☆1,139Updated last week
- This repository provides an advanced Retrieval-Augmented Generation (RAG) solution for complex question answering. It uses sophisticated …☆1,532Updated 7 months ago
- ContextGem: Effortless LLM extraction from documents☆1,762Updated last month
- This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.☆2,413Updated 11 months ago
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,693Updated this week
- The Context Graph Factory for AI. Build, manage, and deploy AI-optimized Context Graphs.☆1,076Updated last week
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,476Updated 3 weeks ago
- A community-driven collection of RAG (Retrieval-Augmented Generation) frameworks, projects, and resources. Contribute and explore the evo…☆1,546Updated 2 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,592Updated last month
- Tool for generating high quality Synthetic datasets☆1,476Updated 3 months ago
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.☆1,097Updated this week
- Python library for Agentic Document Extraction from LandingAI☆2,343Updated last week
- Deploy your agentic worfklows to production☆2,071Updated last week
- High-performance retrieval engine for unstructured data☆1,554Updated 2 months ago
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆933Updated this week