multimodal-art-projection / P2PLinks
P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark
☆37Updated 2 months ago
Alternatives and similar repositories for P2P
Users that are interested in P2P are comparing it to the libraries listed below
Sorting:
- Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search …☆113Updated last month
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆225Updated 2 weeks ago
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆300Updated last month
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆65Updated last year
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆82Updated 6 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆90Updated last year
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆284Updated this week
- Extrapolating RLVR to General Domains without Verifiers☆136Updated 2 weeks ago
- [ACL 2024] ChartAssistant is a chart-based vision-language model for universal chart comprehension and reasoning.☆123Updated 11 months ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆60Updated last year
- ☆67Updated last month
- ☆91Updated last month
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆127Updated 3 months ago
- ☆159Updated 3 months ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆52Updated 2 months ago
- ☆50Updated 5 months ago
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆54Updated last week
- Repository for the NeurIPS 2024 paper "SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up…☆25Updated 8 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆81Updated 2 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆152Updated last month
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆62Updated 3 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆53Updated 2 months ago
- ☆176Updated last month
- Awesome Agent Training☆208Updated this week
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆90Updated this week
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆22Updated 2 months ago
- A Self-Training Framework for Vision-Language Reasoning☆80Updated 6 months ago
- MMR1: Advancing the Frontiers of Multimodal Reasoning☆162Updated 4 months ago
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆77Updated 3 weeks ago
- The demo, code and data of FollowRAG☆74Updated last month