Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
β498Jul 23, 2025Updated 8 months ago
Alternatives and similar repositories for VARAG
Users that are interested in VARAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β356Jun 2, 2025Updated 10 months ago
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.β2,596Apr 6, 2026Updated last week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β846Jan 28, 2025Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.β270Mar 25, 2026Updated 3 weeks ago
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.β¦β713Nov 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installationβ156Oct 9, 2024Updated last year
- Parsing-free RAG supported by VLMsβ944Dec 7, 2025Updated 4 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.β235Oct 24, 2024Updated last year
- An open-source RAG-based tool for chatting with your documents.β25,288Apr 3, 2026Updated 2 weeks ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β88May 29, 2024Updated last year
- Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLβ¦β629Jul 26, 2025Updated 8 months ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demoβ405Jun 26, 2025Updated 9 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.β123Oct 7, 2024Updated last year
- A new novel multi-modality (Vision) RAG architectureβ40Oct 1, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ6,804Dec 12, 2025Updated 4 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,908Jan 9, 2026Updated 3 months ago
- The easiest way to use Agentic RAG in any enterpriseβ4,428Jan 22, 2025Updated last year
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.β200Jul 22, 2024Updated last year
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,535May 20, 2025Updated 11 months ago
- Structured information extraction from documentsβ316Sep 26, 2024Updated last year
- A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.β1,447Feb 11, 2026Updated 2 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.reβ¦β46Nov 18, 2024Updated last year
- Mastering Applied AI, One Concept at a Timeβ2,178Feb 27, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,476Apr 30, 2025Updated 11 months ago
- A system for agentic LLM-powered data processing and ETLβ3,706Mar 27, 2026Updated 3 weeks ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ7,646Jul 14, 2025Updated 9 months ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understandingβ2,373May 30, 2025Updated 10 months ago
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β7,764Nov 7, 2025Updated 5 months ago
- A language model programming library.β5,873Jun 5, 2025Updated 10 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Modelβ8,109Feb 10, 2025Updated last year
- β869Mar 18, 2025Updated last year
- Empowering RAG with a memory-based data interface for all-purpose applications!β2,237Sep 11, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- SearchGPT / Perplexity clone, but personalised for you.β1,319Jul 8, 2025Updated 9 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,608Dec 20, 2025Updated 3 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,588Apr 10, 2026Updated last week
- π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)β6,840Jul 4, 2025Updated 9 months ago
- An autoagentic AGI that is self-evolving and modular.β964Sep 4, 2024Updated last year
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,668Updated this week
- An example of multi-agent orchestration with llama-indexβ445Jan 23, 2025Updated last year