da03 / Internalize_CoT_Step_by_Step
☆170Updated 2 weeks ago
Alternatives and similar repositories for Internalize_CoT_Step_by_Step:
Users that are interested in Internalize_CoT_Step_by_Step are comparing it to the libraries listed below
- ☆127Updated 5 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆163Updated 2 weeks ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆189Updated 5 months ago
- For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.☆113Updated this week
- ☆163Updated last month
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆177Updated last month
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆54Updated 7 months ago
- The HELMET Benchmark☆142Updated 2 weeks ago
- ☆60Updated last year
- ☆97Updated 10 months ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆189Updated 9 months ago
- Replicating O1 inference-time scaling laws☆84Updated 5 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆141Updated 2 weeks ago
- ☆72Updated 5 months ago
- ☆111Updated 5 months ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆165Updated 4 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆133Updated 7 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 3 months ago
- ☆114Updated 2 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆53Updated 8 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆93Updated last month
- ☆150Updated 4 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆105Updated 2 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆102Updated 7 months ago
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆49Updated 5 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆145Updated last month
- Benchmarking LLMs with Challenging Tasks from Real Users☆221Updated 6 months ago
- The official repository of the Omni-MATH benchmark.☆83Updated 4 months ago
- Self-Alignment with Principle-Following Reward Models☆160Updated last year