ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆289Updated 2 months ago
Alternatives and similar repositories for marc:
Users that are interested in marc are comparing it to the libraries listed below
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆325Updated last month
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆294Updated 2 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆160Updated this week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆182Updated last week
- Bootstrapping ARC☆100Updated 2 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆137Updated last week
- ☆98Updated last month
- Draw more samples☆186Updated 7 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆167Updated last month
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆182Updated 2 months ago
- ☆146Updated last week
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆258Updated 8 months ago
- ☆106Updated 3 weeks ago
- ☆171Updated last year
- ☆158Updated last month
- Some preliminary explorations of Mamba's context scaling.☆213Updated last year
- A simple unified framework for evaluating LLMs☆195Updated last week
- Code for the paper 🌳 Tree Search for Language Model Agents☆175Updated 6 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆183Updated 8 months ago
- Automatic Evals for LLMs☆201Updated this week
- Understand and test language model architectures on synthetic tasks.☆181Updated last month
- [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models☆194Updated 2 weeks ago
- Can Language Models Solve Olympiad Programming?☆110Updated last month
- Sparsify transformers with SAEs and transcoders☆458Updated this week
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆178Updated 6 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆120Updated 3 months ago
- AWM: Agent Workflow Memory☆239Updated 2 weeks ago
- ☆265Updated 7 months ago