☆206Apr 19, 2025Updated 11 months ago
Alternatives and similar repositories for Internalize_CoT_Step_by_Step
Users that are interested in Internalize_CoT_Step_by_Step are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆144Nov 11, 2024Updated last year
- https://interactivetraining.ai/☆17Oct 2, 2025Updated 5 months ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,536Aug 12, 2025Updated 7 months ago
- ☆74Apr 27, 2024Updated last year
- ☆26Jan 14, 2025Updated last year
- Code for Heima☆59Apr 21, 2025Updated 11 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆176Jan 16, 2025Updated last year
- ☆39Mar 29, 2024Updated last year
- Code for Quiet-STaR☆741Aug 21, 2024Updated last year
- ☆16Mar 22, 2025Updated last year
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- ☆70Jun 18, 2025Updated 9 months ago
- ☆15Jul 9, 2025Updated 8 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆345Updated this week
- [COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs☆64Mar 9, 2026Updated 2 weeks ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated 2 years ago
- A curated list of awesome Deep Learning theories that shed light on the mysteries of DL☆10Jul 20, 2018Updated 7 years ago
- Code for "Reasoning to Learn from Latent Thoughts"☆126Mar 28, 2025Updated 11 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆76Jun 23, 2025Updated 9 months ago
- An exploration of LLM steering☆25Jun 15, 2024Updated last year
- Simple RL training for reasoning☆3,841Dec 23, 2025Updated 3 months ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Apr 29, 2025Updated 10 months ago
- ☆124Feb 21, 2025Updated last year
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆98Nov 17, 2024Updated last year
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆33Mar 2, 2025Updated last year
- ☆15Feb 21, 2024Updated 2 years ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆267Jul 8, 2025Updated 8 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 6 months ago
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 5 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆303Updated this week
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Mar 11, 2025Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆83May 30, 2025Updated 9 months ago
- Official Repository of LatentSeek☆78Jun 6, 2025Updated 9 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆345Nov 10, 2025Updated 4 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆118Jun 15, 2024Updated last year
- ☆146Sep 12, 2025Updated 6 months ago
- ☆49Apr 11, 2025Updated 11 months ago
- Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024☆18Mar 25, 2025Updated 11 months ago