snumprlab / himaLinks
Official Implementation of HIMA (COLM'25)
☆19Updated 2 months ago
Alternatives and similar repositories for hima
Users that are interested in hima are comparing it to the libraries listed below
Sorting:
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆26Updated 2 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆42Updated 3 weeks ago
- ☆17Updated last year
- ☆19Updated 7 months ago
- ☆14Updated last year
- Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆47Updated 2 months ago
- The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated 4 months ago
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆30Updated 2 weeks ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Updated 4 months ago
- More reliable Video Understanding Evaluation☆13Updated 4 months ago
- Retrieval-Augmented Decision Transformer: External Memory for In-context RL☆24Updated last year
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆23Updated last month
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Updated last year
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆57Updated 3 months ago
- ☆14Updated last year
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆17Updated 3 months ago
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Updated 7 months ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆26Updated 3 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆23Updated last week
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆36Updated last month
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆33Updated last year
- DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning☆166Updated 2 months ago
- Preference Learning for LLaVA☆58Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆17Updated 10 months ago
- ☆24Updated 5 months ago
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Updated last year
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Updated 11 months ago
- ☆23Updated 8 months ago
- MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆22Updated 4 months ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆47Updated last year