Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo
☆405Jun 26, 2025Updated 8 months ago
Alternatives and similar repositories for vision-is-all-you-need
Users that are interested in vision-is-all-you-need are comparing it to the libraries listed below
Sorting:
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆496Jul 23, 2025Updated 7 months ago
- ☆37Nov 21, 2024Updated last year
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆807Feb 20, 2026Updated 2 weeks ago
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,578Jan 20, 2025Updated last year
- ☆36Apr 30, 2025Updated 10 months ago
- Own your AI, search the web with it🌐😎☆93Jan 14, 2025Updated last year
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,987Dec 8, 2025Updated 3 months ago
- Use OpenAI's realtime API for a chatting with your documents☆328Oct 6, 2024Updated last year
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆568Nov 20, 2025Updated 3 months ago
- An AI Agent for Personal Self-Reflection☆64Feb 7, 2025Updated last year
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,489Aug 27, 2025Updated 6 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,940Sep 24, 2025Updated 5 months ago
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation☆4,601Dec 23, 2025Updated 2 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,343Feb 21, 2025Updated last year
- ☆114Nov 25, 2024Updated last year
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆7,493Feb 11, 2026Updated 3 weeks ago
- 🦾 Take control of your AI agents☆1,388Aug 22, 2025Updated 6 months ago
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,389Feb 25, 2026Updated last week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,455Apr 30, 2025Updated 10 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆845Jan 28, 2025Updated last year
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆933Jan 30, 2025Updated last year
- Parsing-free RAG supported by VLMs☆929Dec 7, 2025Updated 3 months ago
- Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VL…☆627Jul 26, 2025Updated 7 months ago
- Implementing the 4 agentic patterns from scratch☆1,690Mar 18, 2025Updated 11 months ago
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆494Dec 25, 2024Updated last year
- ☆26Jul 8, 2025Updated 8 months ago
- A system for agentic LLM-powered data processing and ETL☆3,669Feb 2, 2026Updated last month
- ☆101Feb 4, 2025Updated last year
- ☆451Sep 2, 2025Updated 6 months ago
- OpenSource Production ready Customer service with built in Evals and monitoring☆1,437Jan 12, 2026Updated last month
- ☆14Sep 4, 2024Updated last year
- Leveraging revolutionary Agent and Phi-2 technology, Graph Detective uncovers concealed linkages and discerns patterns, enabling pinpoint…☆10Apr 21, 2024Updated last year
- The Open Source Memory Layer For Autonomous Agents☆2,576Oct 22, 2024Updated last year
- The easiest way to get started with LlamaIndex☆1,479Jul 16, 2025Updated 7 months ago
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,129Updated this week
- Enterprise-grade memory framework for LLMs featuring GPU-optimized inference, vector storage, and automated scaling. Enables hyper-person…☆91May 3, 2025Updated 10 months ago
- Structured information extraction from documents☆318Sep 26, 2024Updated last year
- Knowledge Agents and Management in the Cloud☆4,240Feb 17, 2026Updated 3 weeks ago
- PraisonAI is a production-ready Multi AI Agents framework, designed to create AI Agents to automate and solve problems ranging from simpl…☆5,638Updated this week