OSU-NLP-Group / GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆161Updated last month
Related projects ⓘ
Alternatives and complementary repositories for GrokkedTransformer
- Repository for the paper Stream of Search: Learning to Search in Language☆91Updated 3 months ago
- Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".☆123Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆128Updated 3 weeks ago
- ☆112Updated last month
- A simple unified framework for evaluating LLMs☆145Updated last week
- ☆102Updated last month
- ☆90Updated 4 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆195Updated 2 weeks ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆65Updated last month
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆110Updated 3 weeks ago
- This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.☆109Updated this week
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆181Updated this week
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆199Updated 6 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆42Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆160Updated 3 months ago
- Can Language Models Solve Olympiad Programming?☆100Updated 3 months ago
- ☆101Updated 3 months ago
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024☆99Updated last week
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆144Updated last month
- ☆107Updated this week
- The official evaluation suite and dynamic data release for MixEval.☆224Updated last week
- A toolkit for describing model features and intervening on those features to steer behavior.☆99Updated last week
- code for training & evaluating Contextual Document Embedding models☆117Updated this week
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆178Updated 5 months ago
- ☆116Updated 5 months ago
- Extract full next-token probabilities via language model APIs☆229Updated 8 months ago
- ☆93Updated last year
- This is the official repository for Inheritune.☆105Updated last month
- Evaluating LLMs with fewer examples☆134Updated 7 months ago