TianheL / LM-Implicit-Reasoning
[Arxiv] Implicit Reasoning in Transformers is Reasoning through Shortcuts
☆14Updated 2 months ago
Alternatives and similar repositories for LM-Implicit-Reasoning
Users that are interested in LM-Implicit-Reasoning are comparing it to the libraries listed below
Sorting:
- The official implementation of Preference Data Reward-Augmentation.☆17Updated 2 weeks ago
- Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]☆45Updated 3 months ago
- ☆14Updated 4 months ago
- [arXiv 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆11Updated last month
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 6 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆22Updated 6 months ago
- ☆45Updated 3 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 4 months ago
- ☆108Updated last week
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆35Updated 7 months ago
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆36Updated 9 months ago
- ☆24Updated last month
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated 2 months ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging (ICML 2025)☆20Updated 2 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆22Updated 2 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆13Updated last month
- This is the repository for NAACL'25 paper "TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning"☆53Updated 2 weeks ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆83Updated last week
- ☆63Updated last week
- The code of arXiv paper: "Dynamic Scaling of Unit Tests for Code Reward Modeling"☆19Updated 4 months ago
- ☆17Updated 4 months ago
- FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models☆43Updated last month
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆31Updated last year
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆15Updated last month
- LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆17Updated last month
- ☆18Updated 2 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆40Updated last week
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆43Updated 2 months ago
- [Preprint] A Generalizable and Purely Unsupervised Self-Training Framework☆57Updated last week
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆69Updated last month