facebookresearch / RAMLinks
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆284Updated 2 weeks ago
Alternatives and similar repositories for RAM
Users that are interested in RAM are comparing it to the libraries listed below
Sorting:
- ☆104Updated 11 months ago
- Tina: Tiny Reasoning Models via LoRA☆282Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆244Updated 4 months ago
- ☆92Updated 3 weeks ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆64Updated 5 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆353Updated 2 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆171Updated 2 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆173Updated last month
- ☆214Updated 6 months ago
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆357Updated this week
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆220Updated this week
- General Reasoner: Advancing LLM Reasoning Across All Domains☆171Updated 3 months ago
- ☆315Updated 3 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆111Updated 7 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆160Updated 3 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆116Updated last year
- ☆116Updated 7 months ago
- ☆331Updated last month
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆128Updated last month
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆179Updated 3 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆190Updated 6 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆343Updated 9 months ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆170Updated last year
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆206Updated last week
- ☆92Updated last week
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆225Updated last week
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆143Updated 9 months ago
- ☆89Updated 4 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆161Updated 5 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆113Updated this week