zhijie-group / SIFTLinks
SIFT: Grounding LLM Reasoning in Contexts via Stickers
ā55Updated 4 months ago
Alternatives and similar repositories for SIFT
Users that are interested in SIFT are comparing it to the libraries listed below
Sorting:
- š This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.ā86Updated this week
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correctionā74Updated 3 months ago
- ā122Updated last month
- RL Scaling and Test-Time Scaling (ICML'25)ā108Updated 5 months ago
- ā47Updated 5 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight samplingā99Updated last month
- ā18Updated 6 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"ā63Updated 3 months ago
- A repo for open research on building large reasoning modelsā68Updated this week
- Interpretable Contrastive Monte Carlo Tree Search Reasoningā49Updated 8 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningā102Updated 2 months ago
- ā36Updated 2 months ago
- ā80Updated 6 months ago
- Large Language Models Can Self-Improve in Long-context Reasoningā71Updated 7 months ago
- ā48Updated last month
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"ā67Updated 2 months ago
- ā71Updated this week
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruningā85Updated 4 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Frameworkā64Updated last month
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningā76Updated 5 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasonersā82Updated last month
- Code for "Reasoning to Learn from Latent Thoughts"ā112Updated 3 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"ā53Updated 11 months ago
- The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyondā152Updated last week
- Code for Heimaā49Updated 2 months ago
- ā113Updated 4 months ago
- Open-Source LLM Coders with Co-Evolving Reinforcement Learningā93Updated last month
- [ACL 2025] Knowledge Unlearning for Large Language Modelsā38Updated 2 months ago
- [NeurIPS-2024] š Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623ā86Updated 9 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learningā45Updated last month