☆205Apr 19, 2025Updated 10 months ago
Alternatives and similar repositories for Internalize_CoT_Step_by_Step
Users that are interested in Internalize_CoT_Step_by_Step are comparing it to the libraries listed below
Sorting:
- ☆141Nov 11, 2024Updated last year
- Training Large Language Model to Reason in a Continuous Latent Space☆1,522Aug 12, 2025Updated 6 months ago
- ☆74Apr 27, 2024Updated last year
- ☆26Jan 14, 2025Updated last year
- Code for Heima☆59Apr 21, 2025Updated 10 months ago
- ☆39Mar 29, 2024Updated last year
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated 11 months ago
- https://interactivetraining.ai/☆17Oct 2, 2025Updated 5 months ago
- ☆70Jun 18, 2025Updated 8 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆75Jun 23, 2025Updated 8 months ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Apr 29, 2025Updated 10 months ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated 2 years ago
- Code for Quiet-STaR☆741Aug 21, 2024Updated last year
- ☆15Jul 9, 2025Updated 7 months ago
- A curated list of awesome Deep Learning theories that shed light on the mysteries of DL☆10Jul 20, 2018Updated 7 years ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆344Dec 16, 2025Updated 2 months ago
- Long Context Extension and Generalization in LLMs☆63Sep 21, 2024Updated last year
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆32Mar 2, 2025Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆176Jan 16, 2025Updated last year
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- Code for "Reasoning to Learn from Latent Thoughts"☆124Mar 28, 2025Updated 11 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆266Jul 8, 2025Updated 7 months ago
- ☆123Feb 21, 2025Updated last year
- Simple RL training for reasoning☆3,830Dec 23, 2025Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- ☆145Sep 12, 2025Updated 5 months ago
- [ICML 2025] Predictive Data Selection: The Data That Predicts Is the Data That Teaches☆62Mar 4, 2025Updated 11 months ago
- [AAAI 2025] Augmenting Math Word Problems via Iterative Question Composing (https://arxiv.org/abs/2401.09003)☆23Oct 2, 2025Updated 5 months ago
- ☆28Oct 2, 2025Updated 5 months ago
- ☆15Feb 21, 2024Updated 2 years ago
- A Structured Span Selector (NAACL 2022). A structured span selector with a WCFG for span selection tasks (coreference resolution, semanti…☆21Jul 11, 2022Updated 3 years ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆118Jun 15, 2024Updated last year
- A library for advanced large language model reasoning☆2,333Jun 10, 2025Updated 8 months ago
- ☆342Jun 5, 2025Updated 8 months ago
- Data preparation code for Amber 7B LLM☆93May 10, 2024Updated last year
- A bibliography and survey of the papers surrounding o1☆1,212Nov 16, 2024Updated last year
- https://icml.cc/virtual/2023/poster/24354☆10Aug 15, 2023Updated 2 years ago
- ☆12Jul 8, 2024Updated last year
- ☆20Aug 8, 2025Updated 6 months ago