da03 / Internalize_CoT_Step_by_Step
☆135Updated 3 months ago
Alternatives and similar repositories for Internalize_CoT_Step_by_Step:
Users that are interested in Internalize_CoT_Step_by_Step are comparing it to the libraries listed below
- ☆113Updated 2 months ago
- ☆93Updated 6 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆206Updated 2 months ago
- ☆115Updated 3 months ago
- ☆89Updated this week
- For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.☆82Updated this week
- Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".☆153Updated 3 months ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆145Updated last month
- A simple unified framework for evaluating LLMs☆164Updated 3 weeks ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆157Updated this week
- ☆58Updated 8 months ago
- LOFT: A 1 Million+ Token Long-Context Benchmark☆164Updated 2 months ago
- Evaluating LLMs with fewer examples☆141Updated 9 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆175Updated last month
- Function Vectors in Large Language Models (ICLR 2024)☆131Updated 3 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆97Updated 6 months ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆130Updated 3 months ago
- Language models scale reliably with over-training and on downstream tasks☆96Updated 9 months ago
- This is the official repository for Inheritune.☆108Updated 3 months ago
- Reproducible, flexible LLM evaluations☆118Updated last month
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆129Updated 2 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆204Updated 7 months ago
- ☆135Updated this week
- Functional Benchmarks and the Reasoning Gap☆82Updated 3 months ago
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆170Updated 5 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆66Updated last year
- Replicating O1 inference-time scaling laws☆70Updated last month
- ☆50Updated 2 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆48Updated 5 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆154Updated 3 months ago