snumprlab / himaLinks
Official Implementation of HIMA (COLM'25)
☆19Updated 2 months ago
Alternatives and similar repositories for hima
Users that are interested in hima are comparing it to the libraries listed below
Sorting:
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆30Updated 3 weeks ago
- ☆17Updated last year
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated this week
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆49Updated last week
- ☁ ️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Updated 8 months ago
- ☆24Updated 5 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆42Updated last week
- ☆24Updated 8 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Updated 4 months ago
- [NAACL 2024] Vision language model that reduces hallucinations through self-feedback guided revision. Visualizes attentions on image feat…☆47Updated last year
- More reliable Video Understanding Evaluation☆14Updated 4 months ago
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆34Updated last year
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)☆19Updated 7 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆60Updated 3 months ago
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images☆18Updated 8 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 5 months ago
- ☆19Updated 8 months ago
- Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆36Updated 2 months ago
- ☆51Updated 9 months ago
- ☆18Updated 3 months ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆26Updated 3 months ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆26Updated 3 months ago
- ☆14Updated last year
- MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆22Updated 4 months ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆45Updated last month
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆23Updated last month
- ☆21Updated 7 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆28Updated 2 weeks ago
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Updated last year
- ☆21Updated last year