facebookresearch / RAMLinks
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆259Updated last month
Alternatives and similar repositories for RAM
Users that are interested in RAM are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆226Updated 2 months ago
- Tina: Tiny Reasoning Models via LoRA☆268Updated last month
- ☆97Updated 9 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆130Updated this week
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆404Updated last week
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆114Updated 2 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆341Updated 7 months ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆177Updated 3 weeks ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆412Updated 2 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆63Updated 3 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆319Updated last week
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆166Updated last week
- General Reasoner: Advancing LLM Reasoning Across All Domains☆149Updated last month
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆206Updated last month
- Complex Function Calling Benchmark.☆118Updated 5 months ago
- PyTorch building blocks for the OLMo ecosystem☆261Updated this week
- This is the official repository for Inheritune.☆112Updated 5 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆137Updated last week
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆223Updated 7 months ago
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆323Updated this week
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆150Updated last month
- ☆117Updated 4 months ago
- Reproducible, flexible LLM evaluations☆219Updated this week
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆205Updated last year
- The official repo for "LLoCo: Learning Long Contexts Offline"☆117Updated last year
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆187Updated 3 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 6 months ago
- ☆304Updated last month
- RL Scaling and Test-Time Scaling (ICML'25)☆109Updated 5 months ago
- Efficient Agent Training for Computer Use☆114Updated last month