snumprlab / himaLinks
Official Implementation of HIMA (COLM'25)
☆19Updated 2 months ago
Alternatives and similar repositories for hima
Users that are interested in hima are comparing it to the libraries listed below
Sorting:
- A Text2SQL benchmark for evaluation of Large Language Models☆42Updated last week
- ☆17Updated last year
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated this week
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆23Updated last month
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆30Updated 3 weeks ago
- ☆14Updated last year
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆26Updated 3 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆49Updated last week
- ☆24Updated 5 months ago
- ☆23Updated 6 months ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆26Updated 3 months ago
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Updated last year
- [NeurIPS 2025] Official Implementation for "Enhancing Vision-Language Model Reliability with Uncertainty-Guided Dropout Decoding"☆22Updated last year
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆60Updated 3 months ago
- More reliable Video Understanding Evaluation☆14Updated 4 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆17Updated 3 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Updated last year
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆34Updated last year
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆24Updated last year
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Updated 3 weeks ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆45Updated last month
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Updated 8 months ago
- ☆51Updated 9 months ago
- ☆33Updated 6 months ago
- ☆24Updated 8 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Updated 4 months ago
- Cost-Sensitive Toolpath Agent for Multi-turn Image Editing☆25Updated 10 months ago
- ☆14Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Updated 4 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Updated last month