shawnricecake / HeimaView external linksLinks
Code for Heima
☆59Apr 21, 2025Updated 9 months ago
Alternatives and similar repositories for Heima
Users that are interested in Heima are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] FastCar☆16May 22, 2025Updated 8 months ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆24Jul 21, 2025Updated 6 months ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆21Aug 11, 2025Updated 6 months ago
- [NeurIPS 2025 Spotlight] Fast-Slow Thinking GRPO for Large Vision-Language Model Reasoning☆40Jan 20, 2026Updated 3 weeks ago
- [ICCAD 2025] Squant☆15Jul 3, 2025Updated 7 months ago
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆13Aug 22, 2025Updated 5 months ago
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"☆15Aug 27, 2025Updated 5 months ago
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆43Oct 14, 2025Updated 4 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆78May 30, 2025Updated 8 months ago
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- Official code repo for NeurIPS 2025 Spotlight paper, "Debate or Vote: Which Yields Better Decisions in Multi-Agent LLMs?"☆48Oct 15, 2025Updated 3 months ago
- All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment☆19Feb 11, 2025Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated last year
- ☆15Mar 30, 2025Updated 10 months ago
- ☆204Apr 19, 2025Updated 9 months ago
- ☆13Jul 20, 2024Updated last year
- ☆14Jul 15, 2024Updated last year
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆34Jun 12, 2025Updated 8 months ago
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆35Oct 26, 2025Updated 3 months ago
- ☆101Dec 22, 2023Updated 2 years ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆53Sep 29, 2025Updated 4 months ago
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆18Oct 7, 2024Updated last year
- ☆19Nov 7, 2022Updated 3 years ago
- Code for "Reasoning to Learn from Latent Thoughts"☆124Mar 28, 2025Updated 10 months ago
- ☆179Dec 5, 2025Updated 2 months ago
- ☆19Oct 9, 2024Updated last year
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Nov 15, 2025Updated 2 months ago
- [Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control☆47Nov 25, 2025Updated 2 months ago
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year
- ☆21Dec 6, 2025Updated 2 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆132Apr 12, 2025Updated 10 months ago
- [ICLR2026] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"☆30Feb 4, 2026Updated last week
- Multi-Granularity Language-Guided Multi-Object Tracking☆24Nov 3, 2025Updated 3 months ago
- [ICLR2025] Official code implementation of Video-UTR: Unhackable Temporal Rewarding for Scalable Video MLLMs☆61Feb 27, 2025Updated 11 months ago
- Reinforcing General Reasoning without Verifiers☆97Jun 24, 2025Updated 7 months ago
- ☆28Apr 8, 2025Updated 10 months ago
- Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"☆22Nov 21, 2023Updated 2 years ago
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆66Jun 10, 2025Updated 8 months ago
- HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model☆86Jul 17, 2025Updated 6 months ago