Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
β497Jul 23, 2025Updated 10 months ago
Alternatives and similar repositories for VARAG
Users that are interested in VARAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β355Jun 2, 2025Updated last year
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.β2,668Jun 10, 2026Updated last week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β850Jan 28, 2025Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.β276Mar 25, 2026Updated 2 months ago
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.β¦β722Nov 5, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installationβ154Oct 9, 2024Updated last year
- Parsing-free RAG supported by VLMsβ964Dec 7, 2025Updated 6 months ago
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.β235Oct 24, 2024Updated last year
- An open-source RAG-based tool for chatting with your documents.β25,467Jun 9, 2026Updated last week
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β88May 29, 2024Updated 2 years ago
- Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLβ¦β631Jul 26, 2025Updated 10 months ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demoβ402Jun 26, 2025Updated 11 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.β130Oct 7, 2024Updated last year
- A new novel multi-modality (Vision) RAG architectureβ39Oct 1, 2024Updated last year
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ7,611Dec 12, 2025Updated 6 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,943May 26, 2026Updated 3 weeks ago
- The easiest way to use Agentic RAG in any enterpriseβ4,440Jan 22, 2025Updated last year
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.β199Jul 22, 2024Updated last year
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,535May 20, 2025Updated last year
- Structured information extraction from documentsβ316May 3, 2026Updated last month
- A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.β1,453Feb 11, 2026Updated 4 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.reβ¦β46Nov 18, 2024Updated last year
- Mastering Applied AI, One Concept at a Timeβ2,214Feb 27, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,478Apr 30, 2025Updated last year
- A system for agentic LLM-powered data processing and ETLβ3,835Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ7,714Jun 8, 2026Updated last week
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understandingβ2,408May 30, 2025Updated last year
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.β7,887Nov 7, 2025Updated 7 months ago
- A language model programming library.β5,873Jun 5, 2025Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Modelβ8,139Feb 10, 2025Updated last year
- β860Mar 18, 2025Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ20,840Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Empowering RAG with a memory-based data interface for all-purpose applications!β2,245Sep 11, 2025Updated 9 months ago
- SearchGPT / Perplexity clone, but personalised for you.β1,320Jul 8, 2025Updated 11 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.β1,621Dec 20, 2025Updated 5 months ago
- π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)β6,873Jul 4, 2025Updated 11 months ago
- An autoagentic AGI that is self-evolving and modular.β971Sep 4, 2024Updated last year
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VLβ2,679Jun 8, 2026Updated last week
- An example of multi-agent orchestration with llama-indexβ444Jan 23, 2025Updated last year