[CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding
☆36Jul 22, 2025Updated 8 months ago
Alternatives and similar repositories for Docopilot
Users that are interested in Docopilot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- ☆24Nov 17, 2025Updated 4 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆91Nov 15, 2024Updated last year
- ☆11Oct 31, 2024Updated last year
- Video Benchmark Suite: Rapid Evaluation of Video Foundation Models☆16Jan 10, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆14Jan 26, 2025Updated last year
- Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)☆30Oct 26, 2025Updated 5 months ago
- PyTorch implementation of the article "Generative Adversarial Network for Handwritten Text"☆10Nov 13, 2023Updated 2 years ago
- This repository is the codebase of TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy☆51Oct 16, 2024Updated last year
- The official repository of MM-R5☆28Jun 22, 2025Updated 9 months ago
- [ECCV'24] Official Implementation of Autoregressive Visual Entity Recognizer.☆14Mar 2, 2024Updated 2 years ago
- [一个聊天软件Demo] a chat software powered by libevent/mysql and qt☆10Sep 10, 2021Updated 4 years ago
- A High-Quality Diabetic Retinopathy Pixel-Level Annotation Dataset☆15Dec 9, 2025Updated 4 months ago
- Large-scale text embedding model☆39Sep 6, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- serverless vscode webide☆17Apr 14, 2023Updated 2 years ago
- Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API☆17Jun 21, 2025Updated 9 months ago
- ☆19May 19, 2024Updated last year
- 同济大学简历模版,做了一点点本地化修改 (generated from fky2015/resume-ng)☆15Dec 3, 2023Updated 2 years ago
- ☆67May 19, 2025Updated 10 months ago
- ☆22Sep 16, 2025Updated 6 months ago
- Cross-modal Reinforced Prompting for Graph and Language Tasks, KDD 2024.☆11Sep 29, 2024Updated last year
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆19Jun 19, 2025Updated 9 months ago
- Targeted synthesis of multi-temporal remote sensing images for change detection using siamese neural networks☆24Feb 15, 2019Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Oct 6, 2024Updated last year
- SimKO: Simple Pass@K Policy Optimization☆28Oct 24, 2025Updated 5 months ago
- Just prepare config file and start training your metric learning model with ease☆16Apr 2, 2024Updated 2 years ago
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 7 months ago
- FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models☆12Dec 21, 2025Updated 3 months ago
- [🎖️1등(장관상) 솔루션] 2022 국립국어원 인공 지능 언어 능력 평가 (쇼핑몰 리뷰 데이터 속성 기반 감성 분석 : Aspect-Based Sentiment Analysis)☆11Jun 6, 2023Updated 2 years ago
- 操作系统第三次课程项目,一个简单的文件系统☆12Jun 24, 2021Updated 4 years ago
- ☆12Aug 10, 2022Updated 3 years ago
- ☆18Mar 19, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization☆17May 29, 2025Updated 10 months ago
- Code for the MTEB leaderboard☆30Feb 4, 2025Updated last year
- ☆38Jan 9, 2026Updated 3 months ago
- ☆23Apr 23, 2019Updated 6 years ago
- ICCV 2025: Official Implematation of "Aligning Vision to Language: Annotation-Free Multimodal Knowledge Graph Construction for Enhanced L…☆71Oct 25, 2025Updated 5 months ago
- [CVPR 2025 Highlight] Official repository for CoMM Dataset☆52Dec 31, 2024Updated last year
- 糖尿病眼底病变分割和分类☆17Jun 12, 2023Updated 2 years ago