tjmlabs / ColiVaraLinks
Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has state of the art retrieval performance on both text and visual documents. using vision models instead of chunking and text-processing for documents. No OCR, no text extraction, no broken tables, or missing imagesβ¦
β1,448Updated 9 months ago
Alternatives and similar repositories for ColiVara
Users that are interested in ColiVara are comparing it to the libraries listed below
Sorting:
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,140Updated this week
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.β801Updated last week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,483Updated 5 months ago
- Python package and backend for the Elysia platform app.β1,876Updated last week
- ContextGem: Effortless LLM extraction from documentsβ1,777Updated last month
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demoβ405Updated 7 months ago
- Deep research agent to help you find the best GitHub repositories π΅οΈ!β842Updated 2 months ago
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) intβ¦β749Updated 11 months ago
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and versβ¦β996Updated this week
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.β1,101Updated last week
- Graph powered context harness for AI agentsβ1,161Updated this week
- xpander.ai is the runtime and control plane to build, run, and ship reliable AI agents fast and anywhereβ856Updated 3 months ago
- Fast State-of-the-Art Static Embeddingsβ1,992Updated last month
- For your multi-agent needsβ1,373Updated 2 months ago
- Generic rag framework to apply the power of LLMs on any given datasetβ668Updated last month
- Make any LLM to think like OpenAI o1 and deepseek R1β492Updated last year
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,525Updated 8 months ago
- A system for agentic LLM-powered data processing and ETLβ3,525Updated last week
- β452Updated 5 months ago
- Doctor is a tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents.β462Updated 8 months ago
- The platform for LLM evaluations and AI agent testingβ2,813Updated this week
- β418Updated last year
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAGβ1,464Updated 8 months ago
- Semantic Chunker is a lightweight Python package for semantically-aware chunking and clustering of text.β292Updated 9 months ago
- An open-source, no-code agent building platform.β1,830Updated 2 months ago
- π¦ CHONK docs with Chonkie β¨ β The lightweight ingestion library for fast, efficient and robust RAG pipelinesβ3,727Updated this week
- π₯ An inbox UX for interacting with human-in-the-loop agents.β918Updated 2 weeks ago
- Tensorlake is a Document Ingestion API and a serverless platform for building data processing and orchestration APIsβ878Updated this week
- Reasoning Augmented Generationβ895Updated 6 months ago
- Legacy Python library for Agentic Document Extraction (ADE). Use the landingai-ade library for all new projects.β2,354Updated this week