Gzy1112 / MMRAG-DocQALinks
☆16Updated last month
Alternatives and similar repositories for MMRAG-DocQA
Users that are interested in MMRAG-DocQA are comparing it to the libraries listed below
Sorting:
- ☆28Updated 11 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 7 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆39Updated last month
- ☆28Updated 11 months ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- The code in "SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design"☆24Updated 3 months ago
- Here is a demo for PDF parser (Including OCR, object detection tools)☆36Updated 11 months ago
- ☆39Updated 5 months ago
- Search, organize, discover anything!☆48Updated last year
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆57Updated 10 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆70Updated last year
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆145Updated 3 months ago
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆40Updated 8 months ago
- [ACL24] Official repo for "Synthesizing Text-to-SQL Data from Weak and Strong LLMs"☆67Updated last year
- ☆48Updated 9 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆85Updated 8 months ago
- ☆95Updated 9 months ago
- DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking☆47Updated 6 months ago
- ☆13Updated last year
- Our 2nd-gen LMM☆34Updated last year
- 从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct,Reyes也通过一个两层MLP投影层连…☆26Updated 7 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆223Updated last month
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆43Updated 5 months ago
- ☆111Updated this week
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- An interactive thinking and deep reasoning model. It provides a cognitive reasoning paradigm for complex multi-hop problems.☆62Updated 2 months ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆126Updated 10 months ago
- A Toolkit for Table-based Question Answering☆112Updated last year
- TianGong-AI-Unstructure☆69Updated this week