Softlandia-Ltd / vision-is-all-you-needView external linksLinks
Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo
☆403Jun 26, 2025Updated 7 months ago
Alternatives and similar repositories for vision-is-all-you-need
Users that are interested in vision-is-all-you-need are comparing it to the libraries listed below
Sorting:
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆493Jul 23, 2025Updated 6 months ago
- ☆38Nov 21, 2024Updated last year
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆801Updated this week
- AI reads books: Page-by-Page PDF Knowledge Extractor & Summarizer. script performs an intelligent page-by-page analysis of PDF books, met…☆1,574Jan 20, 2025Updated last year
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,976Dec 8, 2025Updated 2 months ago
- ☆35Apr 30, 2025Updated 9 months ago
- Own your AI, search the web with it🌐😎☆94Jan 14, 2025Updated last year
- Use OpenAI's realtime API for a chatting with your documents☆330Oct 6, 2024Updated last year
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,938Sep 24, 2025Updated 4 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆565Nov 20, 2025Updated 2 months ago
- An AI Agent for Personal Self-Reflection☆63Feb 7, 2025Updated last year
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,483Aug 27, 2025Updated 5 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,275Feb 21, 2025Updated 11 months ago
- AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation☆4,573Dec 23, 2025Updated last month
- ☆114Nov 25, 2024Updated last year
- Flexible and powerful framework for managing multiple AI agents and handling complex conversations☆7,428Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,451Apr 30, 2025Updated 9 months ago
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,359Dec 31, 2025Updated last month
- 🦾 Take control of your AI agents☆1,387Aug 22, 2025Updated 5 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆842Jan 28, 2025Updated last year
- A system for agentic LLM-powered data processing and ETL☆3,557Feb 2, 2026Updated 2 weeks ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆933Jan 30, 2025Updated last year
- Parsing-free RAG supported by VLMs☆910Dec 7, 2025Updated 2 months ago
- Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VL…☆626Jul 26, 2025Updated 6 months ago
- Implementing the 4 agentic patterns from scratch☆1,680Mar 18, 2025Updated 10 months ago
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆495Dec 25, 2024Updated last year
- ☆26Jul 8, 2025Updated 7 months ago
- ☆451Sep 2, 2025Updated 5 months ago
- ☆101Feb 4, 2025Updated last year
- OpenSource Production ready Customer service with built in Evals and monitoring☆1,435Jan 12, 2026Updated last month
- Leveraging revolutionary Agent and Phi-2 technology, Graph Detective uncovers concealed linkages and discerns patterns, enabling pinpoint…☆10Apr 21, 2024Updated last year
- ☆13Sep 4, 2024Updated last year
- The Open Source Memory Layer For Autonomous Agents☆2,564Oct 22, 2024Updated last year
- The easiest way to get started with LlamaIndex☆1,479Jul 16, 2025Updated 7 months ago
- Knowledge Agents and Management in the Cloud☆4,231Updated this week
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,122Updated this week
- Enterprise-grade memory framework for LLMs featuring GPU-optimized inference, vector storage, and automated scaling. Enables hyper-person…☆90May 3, 2025Updated 9 months ago
- Structured information extraction from documents☆318Sep 26, 2024Updated last year
- An open-source RAG-based tool for chatting with your documents.☆25,095Jul 4, 2025Updated 7 months ago