adithya-s-k / VARAGView external linksLinks
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
β493Jul 23, 2025Updated 6 months ago
Alternatives and similar repositories for VARAG
Users that are interested in VARAG are comparing it to the libraries listed below
Sorting:
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β352Jun 2, 2025Updated 8 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.β843Jan 28, 2025Updated last year
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.β2,503Feb 3, 2026Updated last week
- Parsing-free RAG supported by VLMsβ910Dec 7, 2025Updated 2 months ago
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.β¦β695Nov 5, 2024Updated last year
- A NextJS/Langflow based app that takes a PDF and converts it into a podcast.β235Oct 24, 2024Updated last year
- Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLβ¦β625Jul 26, 2025Updated 6 months ago
- An open-source RAG-based tool for chatting with your documents.β25,019Jul 4, 2025Updated 7 months ago
- The latest graphrag interface is used, using the local ollama to provide the LLM interface.Support for using the pip installationβ156Oct 9, 2024Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.β259Jan 21, 2026Updated 3 weeks ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demoβ405Jun 26, 2025Updated 7 months ago
- The easiest way to use Agentic RAG in any enterpriseβ4,398Jan 22, 2025Updated last year
- A system for agentic LLM-powered data processing and ETLβ3,557Feb 2, 2026Updated last week
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.β199Jul 22, 2024Updated last year
- A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.β1,430Sep 22, 2025Updated 4 months ago
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ6,796Dec 12, 2025Updated 2 months ago
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your dataβ1,526May 20, 2025Updated 8 months ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate