The code used to train and run inference with MMDocIR
☆32May 29, 2025Updated 9 months ago
Alternatives and similar repositories for MMDocIR
Users that are interested in MMDocIR are comparing it to the libraries listed below
Sorting:
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated last year
- Can VLMs understand students' hand-drawn math work?☆17Jan 20, 2026Updated 2 months ago
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 6 months ago
- Official implement of ACL'25 Findings paper "MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Lang…☆20Jun 17, 2025Updated 9 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- ☆19Oct 2, 2023Updated 2 years ago
- ☆12Jun 12, 2024Updated last year
- [WWW24-UrbanCLIP] A comprehensive toolkit designed to facilitate the collection, processing, and integration of satellite imagery and ass…☆17Oct 6, 2024Updated last year
- Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)☆23Oct 22, 2024Updated last year
- This is the code for reproducing the TABBIE baseline in our paper: "Retrieval-Based Transformer for Table Augmentation"☆12Sep 17, 2025Updated 6 months ago
- [CVPR26] Official code for GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristic☆66Feb 21, 2026Updated 3 weeks ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Aug 4, 2025Updated 7 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆73Dec 8, 2025Updated 3 months ago
- ☆31Jul 4, 2024Updated last year
- Open Source Virtual Assistant Framework☆13Sep 4, 2025Updated 6 months ago
- Self-Service Semantic Suite (S4)☆18Sep 29, 2016Updated 9 years ago
- Orchestration middleware for Home Assistant + Ollama: enables 8-20B models to handle complex multi-intent commands through intelligent ta…☆24Feb 6, 2026Updated last month
- The official repo for the DanQing dataset.☆31Jan 16, 2026Updated 2 months ago
- ☆21Jul 18, 2024Updated last year
- Quality Shapes Extraction from very large Knowledge Graphs☆12Nov 15, 2025Updated 4 months ago
- Code for our TVCG paper "DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera".☆19Aug 22, 2025Updated 6 months ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- PostgreSQL extension for vector search, embeddings, and ML, plus NeuronAgent runtime and NeuronMCP server.☆43Mar 3, 2026Updated 2 weeks ago
- Personalized knowledge graph summarization based on historical queries☆14Jun 17, 2020Updated 5 years ago
- ☆15Sep 23, 2024Updated last year
- ☆12Oct 17, 2022Updated 3 years ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆19Nov 22, 2025Updated 3 months ago
- ☆10Sep 27, 2021Updated 4 years ago
- Code for Findings of ACL 2021 paper "Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain …☆19Dec 16, 2022Updated 3 years ago
- This is an implementation of the POI recommendation model-PPR.☆10Apr 19, 2023Updated 2 years ago
- Code and data for the paper: IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Large Language Models …☆11Apr 27, 2024Updated last year
- The "VIVO-ISF Ontology" is an OWL2 representation of the VIVO-ISF Data Standard☆18Mar 13, 2019Updated 7 years ago
- ☆20Updated this week
- Python implementation of the supervised graph prediction method proposed in http://arxiv.org/abs/2202.03813 using PyTorch library and POT…