OSU-NLP-Group / GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆175Updated last month
Alternatives and similar repositories for GrokkedTransformer:
Users that are interested in GrokkedTransformer are comparing it to the libraries listed below
- ☆135Updated 3 months ago
- ☆93Updated 6 months ago
- Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".☆153Updated 3 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆134Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆154Updated 2 months ago
- A simple unified framework for evaluating LLMs☆164Updated 3 weeks ago
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆178Updated last month
- ☆89Updated this week
- Repository for the paper Stream of Search: Learning to Search in Language☆118Updated 5 months ago
- Functional Benchmarks and the Reasoning Gap☆82Updated 3 months ago
- ☆115Updated 3 months ago
- The official evaluation suite and dynamic data release for MixEval.☆233Updated 2 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆154Updated 3 months ago
- ☆135Updated this week
- Benchmarking LLMs with Challenging Tasks from Real Users☆206Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆157Updated this week
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆204Updated 7 months ago
- For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.☆82Updated this week
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆48Updated 5 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆120Updated last month
- A toolkit for describing model features and intervening on those features to steer behavior.☆149Updated 2 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆78Updated last month
- LOFT: A 1 Million+ Token Long-Context Benchmark☆164Updated 2 months ago
- ☆115Updated this week
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024☆113Updated 2 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆278Updated last month
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆153Updated last month
- This is the official repository for Inheritune.☆108Updated 3 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆163Updated 5 months ago
- ☆119Updated last month