TianheL / LM-Implicit-ReasoningLinks
[ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts
☆15Updated 4 months ago
Alternatives and similar repositories for LM-Implicit-Reasoning
Users that are interested in LM-Implicit-Reasoning are comparing it to the libraries listed below
Sorting:
- The official implementation of Preference Data Reward-Augmentation.☆17Updated 2 months ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆46Updated 5 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆19Updated 3 weeks ago
- FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning☆52Updated last month
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆64Updated last month
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆13Updated 6 months ago
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆37Updated 11 months ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆16Updated 3 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆38Updated 9 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆39Updated 2 months ago
- ☆30Updated 3 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆18Updated last month
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆59Updated last year
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆74Updated 9 months ago
- ☆16Updated 3 months ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆22Updated 3 weeks ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆63Updated 3 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆93Updated last week
- ☆20Updated last month
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Updated 4 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆46Updated 2 months ago
- ☆16Updated 6 months ago
- ☆47Updated 5 months ago
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆54Updated 8 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆16Updated last week
- [COLING'25] Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?☆79Updated 5 months ago
- ☆48Updated last month
- CS194-196 Course Project☆15Updated 4 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated last month
- ☆126Updated 2 months ago