Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
β499Jul 23, 2025Updated 10 months ago
Alternatives and similar repositories for VARAG
Users that are interested in VARAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β356Jun 2, 2025Updated 11 months ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.β2,637May 19, 2026Updated last week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β847Jan 28, 2025Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.β272Mar 25, 2026Updated 2 months ago
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.β¦β718Nov 5, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installationβ154Oct 9, 2024Updated last year
- Parsing-free RAG supported by VLMsβ956Dec 7, 2025Updated 5 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.β235Oct 24, 2024Updated last year
- An open-source RAG-based tool for chatting with your documents.β25,394Apr 3, 2026Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β88May 29, 2024Updated 2 years ago
- Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLβ¦β631Jul 26, 2025Updated 10 months ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demoβ405Jun 26, 2025Updated 11 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.β130Oct 7, 2024Updated last year
- A new novel multi-modality (Vision) RAG architectureβ39Oct 1, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ6,820Dec 12, 2025Updated 5 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,919Jan 9, 2026Updated 4 months ago
- The easiest way to use Agentic RAG in any enterpriseβ4,437Jan 22, 2025Updated last year
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.β199Jul 22, 2024Updated last year
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,534May 20, 2025Updated last year
- Structured information extraction from documentsβ317May 3, 2026Updated 3 weeks ago
- A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.β1,452Feb 11, 2026Updated 3 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.reβ¦β46Nov 18, 2024Updated last year
- Mastering Applied AI, One Concept at a Timeβ2,209Feb 27, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,476Apr 30, 2025Updated last year
- A system for agentic LLM-powered data processing and ETLβ3,754May 20, 2026Updated last week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ7,710May 11, 2026Updated 2 weeks ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understandingβ2,406May 30, 2025Updated last year
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β7,852Nov 7, 2025Updated 6 months ago
- A language model programming library.β5,870Jun 5, 2025Updated 11 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Modelβ8,127Feb 10, 2025Updated last year
- β867Mar 18, 2025Updated last year
- Empowering RAG with a memory-based data interface for all-purpose applications!β2,243Sep 11, 2025Updated 8 months ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- SearchGPT / Perplexity clone, but personalised for you.β1,320Jul 8, 2025Updated 10 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,615Dec 20, 2025Updated 5 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,787Updated this week
- π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)β6,860Jul 4, 2025Updated 10 months ago
- An autoagentic AGI that is self-evolving and modular.β969Sep 4, 2024Updated last year
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,674May 18, 2026Updated last week
- An example of multi-agent orchestration with llama-indexβ443Jan 23, 2025Updated last year