THU-KEG / LongWriter-VLinks
[ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
☆23Updated 10 months ago
Alternatives and similar repositories for LongWriter-V
Users that are interested in LongWriter-V are comparing it to the libraries listed below
Sorting:
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆17Updated 3 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Updated 11 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Updated 4 months ago
- Official implementation of ECCV24 paper: POA☆24Updated last year
- JudgeLRM: Large Reasoning Models as a Judge☆41Updated 2 weeks ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 4 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆63Updated last year
- ☆50Updated 8 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Updated last year
- More reliable Video Understanding Evaluation☆14Updated 4 months ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆28Updated 7 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Updated 3 months ago
- ☆32Updated 2 weeks ago
- ☆16Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated last year
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆34Updated 5 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Updated last year
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search" [EMNLP25]☆38Updated last week
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Updated 6 months ago
- ☆68Updated 4 months ago
- ☆19Updated 11 months ago
- [AAAI 2026] Multimodal Deepresearcher: Generating Text-Chart Interleaved Reports From Scratch with Agentic Framework☆44Updated 2 weeks ago
- ☆23Updated last year
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Updated 3 months ago
- Long Context Research☆26Updated 2 weeks ago
- ☆52Updated 8 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆38Updated last week
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆35Updated 3 months ago
- ☆47Updated 4 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).☆12Updated 4 months ago