MaybeLizzy / PERMULinks
☆32Updated 3 months ago
Alternatives and similar repositories for PERMU
Users that are interested in PERMU are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆64Updated 4 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆89Updated 11 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆38Updated 7 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Updated 9 months ago
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆35Updated 6 months ago
- Paper List of Inference/Test Time Scaling/Computing☆346Updated 5 months ago
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆73Updated 4 months ago
- VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues☆44Updated 8 months ago
- (ICLR 2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆47Updated 7 months ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆39Updated 6 months ago
- [NeurIPS'25] HoliTom: Holistic Token Merging for Fast Video Large Language Models☆70Updated 3 months ago
- A Collection of Papers on Diffusion Language Models☆154Updated 4 months ago
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆248Updated 3 months ago
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆93Updated 2 months ago
- Code for CVPR 2024 Oral "Neural Lineage"☆17Updated last year
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆104Updated 4 months ago
- MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency☆137Updated 5 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆24Updated 10 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆85Updated 7 months ago
- Official Repository of LatentSeek☆76Updated 7 months ago
- [arXiv:2508.00410] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"☆30Updated 3 months ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆43Updated 4 months ago
- ☆56Updated last year
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆110Updated last month
- ☆204Updated last month
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆60Updated last year
- Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".☆24Updated 3 months ago
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆114Updated 6 months ago
- Data distillation benchmark☆71Updated 7 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆76Updated 8 months ago