[ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
☆23Mar 29, 2025Updated last year
Alternatives and similar repositories for LongWriter-V
Users that are interested in LongWriter-V are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆35Jun 5, 2025Updated last year
- FocusLLM: Scaling LLM’s Context by Parallel Decoding☆45Dec 8, 2024Updated last year
- [NeurIPS 2025] Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆33Oct 20, 2025Updated 7 months ago
- ☆42Jun 18, 2025Updated 11 months ago
- ☆18May 13, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆21Aug 21, 2025Updated 9 months ago
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated last year
- ☆15Apr 11, 2024Updated 2 years ago
- One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning☆40Jul 1, 2023Updated 2 years ago
- Xlore2.0 Code[BaiduExtractor, HudongExtractor, WikiExtractor, XloreData, XloreWeb]☆12Apr 5, 2017Updated 9 years ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆53Dec 13, 2025Updated 6 months ago
- Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models☆27Jul 14, 2025Updated 11 months ago
- ☆47Jun 24, 2025Updated 11 months ago
- ☆62Oct 29, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repository of DialSim☆32Oct 31, 2025Updated 7 months ago
- ☆15Jan 24, 2025Updated last year
- ☆12Apr 25, 2024Updated 2 years ago
- Code for the paper: Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness☆12Oct 22, 2023Updated 2 years ago
- Code for ACL 2022 paper "HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization".☆13May 24, 2022Updated 4 years ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 10 months ago
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆93Jun 16, 2025Updated 11 months ago
- Baselines for all tasks from Long Code Arena benchmarks 🏟️☆38Mar 30, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆31Sep 12, 2025Updated 9 months ago
- [CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps☆13Mar 26, 2025Updated last year
- DICE: Detecting In-distribution Data Contamination with LLM's Internal State☆12Sep 21, 2024Updated last year
- MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision☆29May 26, 2025Updated last year
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆43Jan 5, 2026Updated 5 months ago
- SFT+RL boosts multimodal reasoning☆50Jun 27, 2025Updated 11 months ago
- ☆36Oct 4, 2025Updated 8 months ago
- ☆24Mar 7, 2025Updated last year
- The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".☆18Apr 25, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆10Jan 28, 2024Updated 2 years ago
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents☆111May 3, 2026Updated last month
- A comprehensive and efficient long-context model evaluation framework☆31Feb 25, 2026Updated 3 months ago
- ☆13Nov 7, 2023Updated 2 years ago
- Completing the Puzzle of All-in-One Event Understanding Benchmark with Event Arguments☆14Mar 12, 2024Updated 2 years ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆22Oct 10, 2024Updated last year
- Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion☆12Jan 14, 2026Updated 5 months ago