shawnricecake / HeimaLinks

Code for Heima

☆49

Alternatives and similar repositories for Heima

Users that are interested in Heima are comparing it to the libraries listed below

Sorting:

luka-group / vlm-knowledge-conflict
Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."
☆42Updated 8 months ago
hkust-nlp / mstar
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆61Updated 6 months ago
GATECH-EIC / ACT
[ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…
☆40Updated last year
xuyige / SoftCoT
ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…
☆31Updated last month
shiqichen17 / VLM_Merging
Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)
☆63Updated last month
SihengLi99 / SEALONG
Large Language Models Can Self-Improve in Long-context Reasoning
☆71Updated 7 months ago
UCSB-NLP-Chang / ThinkPrune
☆36Updated 3 months ago
NUS-TRAIL / NoisyRollout
NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
☆78Updated last month
MingLiiii / Layer_Gradient
[ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
☆70Updated 3 weeks ago
ShadeCloak / ADORA
☆46Updated 3 months ago
yule-BUAA / MergeLLM
Codes for Merging Large Language Models
☆32Updated 11 months ago
RM-R1-UIUC / RM-R1
RM-R1: Unleashing the Reasoning Potential of Reward Models
☆113Updated 3 weeks ago
cxcscmu / Montessori-Instruct
Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]
☆46Updated 5 months ago
yihedeng9 / STIC
Enhancing Large Vision Language Models with Self-Training on Image Comprehension.
☆68Updated last year
zhijie-group / SIFT
SIFT: Grounding LLM Reasoning in Contexts via Stickers
☆55Updated 4 months ago
horseee / CoT-Valve
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆76Updated 5 months ago
bigai-nlco / LatentSeek
Official Repository of LatentSeek
☆51Updated last month
TIGER-AI-Lab / VL-Rethinker
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"
☆124Updated last month
aeroplanepaper / GRPO-LEAD
☆19Updated 2 months ago
GaryStack / MMR-V
Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?
☆31Updated 3 weeks ago
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆75Updated last month
LLM360 / Reasoning360
A repo for open research on building large reasoning models
☆68Updated last week
yunfeixie233 / ViGaL
☆48Updated last month
ruixin31 / Spurious_Rewards
☆318Updated last month
njucckevin / MM-Self-Improve
A Self-Training Framework for Vision-Language Reasoning
☆80Updated 5 months ago
OpenSparseLLMs / LLaMA-MoE-v2
🚀 LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
☆86Updated 7 months ago
GeniusHTX / TALE
☆122Updated last month
NineAbyss / S2R
This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
☆67Updated 2 months ago
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆96Updated last week
luka-group / mDPO
[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.
☆76Updated 8 months ago