shangshang-wang / ResaLinks
Resa: Transparent Reasoning Models via SAEs
☆39Updated last month
Alternatives and similar repositories for Resa
Users that are interested in Resa are comparing it to the libraries listed below
Sorting:
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem”☆18Updated last month
- ☆23Updated 3 weeks ago
- Official Repo for RuleReasoner.☆24Updated last month
- Official implementation of ECCV24 paper: POA☆24Updated 11 months ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated this week
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆19Updated 3 weeks ago
- Lottery Ticket Adaptation☆39Updated 7 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 3 months ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆18Updated last month
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆19Updated 4 months ago
- ☆16Updated 11 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆33Updated 3 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆46Updated 4 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 4 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆15Updated this week
- ☆47Updated 5 months ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆15Updated 3 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆27Updated 2 months ago
- ☆33Updated 2 weeks ago
- ☆48Updated last month
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆23Updated 8 months ago
- ☆33Updated 6 months ago
- Fork of Flame repo for training of some new stuff in development☆14Updated 3 weeks ago
- ☆19Updated 4 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 3 months ago
- Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆93Updated last month
- ☆18Updated 6 months ago
- implementation of dualformer☆17Updated 4 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆63Updated last month
- ☆71Updated this week