dame-cell / VisionRAG
A new novel multi-modality (Vision) RAG architecture
☆25Updated 6 months ago
Alternatives and similar repositories for VisionRAG:
Users that are interested in VisionRAG are comparing it to the libraries listed below
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆72Updated 3 weeks ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆65Updated 6 months ago
- RAGLight is a lightweight and modular Python library for implementing Retrieval-Augmented Generation (RAG), Agentic RAG and RAT (Retrieva…☆24Updated 3 weeks ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 3 months ago
- Universal text classifier for generative models☆23Updated 8 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆50Updated 4 months ago
- ☆41Updated 4 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 5 months ago
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆141Updated 10 months ago
- [ICLR 2025] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆40Updated 4 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- ☆45Updated 6 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 7 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆82Updated 2 months ago
- ☆20Updated last month
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆44Updated last year
- [WWW 2025] A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.☆61Updated 3 weeks ago
- ☆24Updated 3 months ago
- Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers☆66Updated 10 months ago
- ☆29Updated 7 months ago
- ☆36Updated 2 years ago
- ☆89Updated 3 weeks ago
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆105Updated 4 months ago
- ☆56Updated 4 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆139Updated 10 months ago
- Deep Research through Multi-Agents, using GraphRAG☆65Updated 5 months ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆98Updated last year
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆109Updated 2 weeks ago
- ☆57Updated 9 months ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆67Updated last year