facebookresearch / RAMLinks
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆256Updated last week
Alternatives and similar repositories for RAM
Users that are interested in RAM are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆219Updated last month
- 🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.☆383Updated 2 weeks ago
- Async pipelined version of Verl☆100Updated 2 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆159Updated 3 weeks ago
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆304Updated last week
- Reproducible, flexible LLM evaluations☆214Updated last month
- Code for the paper: "Learning to Reason without External Rewards"☆306Updated last week
- SkyRL-v0: Train Real-World Long-Horizon Agents via Reinforcement Learning☆422Updated last week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆339Updated 6 months ago
- Tina: Tiny Reasoning Models via LoRA☆260Updated 3 weeks ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆188Updated 11 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆222Updated last month
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆54Updated 10 months ago
- ☆181Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains☆142Updated 2 weeks ago
- ☆300Updated 3 weeks ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆202Updated 3 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆106Updated 5 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆149Updated 2 weeks ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆402Updated last month
- Simple extension on vLLM to help you speed up reasoning model without training.☆161Updated 3 weeks ago
- [NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models☆221Updated last month
- A project to improve skills of large language models☆429Updated this week
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆109Updated this week
- Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆107Updated 2 months ago
- LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.☆231Updated 10 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆61Updated 2 months ago
- ☆125Updated 2 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆184Updated this week
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆163Updated last year