facebookresearch / RAMLinks

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

☆297

Alternatives and similar repositories for RAM

Users that are interested in RAM are comparing it to the libraries listed below

Sorting:

shangshang-wang / Tina
Tina: Tiny Reasoning Models via LoRA
☆305Updated last month
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆356Updated 11 months ago
facebookresearch / sweet_rl
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆250Updated 6 months ago
ZihanWang314 / CoE
Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models
☆223Updated 2 weeks ago
ScalingIntelligence / large_language_monkeys
☆108Updated last year
lucidrains / coconut-pytorch
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
☆180Updated 5 months ago
facebookresearch / meta-agents-research-environments
Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike stat…
☆364Updated this week
Zhiyuan-Zeng / RLVE
[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
☆134Updated this week
SalesforceAIResearch / LaTRO
☆124Updated 8 months ago
ypwang61 / One-Shot-RLVR
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
☆376Updated last month
GAIR-NLP / OctoThinker
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆180Updated 3 months ago
efficientscaling / Z1
[EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"
☆66Updated 7 months ago
vsubramaniam851 / multiagent-ft
☆222Updated 8 months ago
TIGER-AI-Lab / General-Reasoner
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆199Updated 3 weeks ago
facebookresearch / PhysicsLM4
Physics of Language Models, Part 4
☆260Updated 3 months ago
RulinShao / retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆218Updated 2 weeks ago
TIGER-AI-Lab / CritiqueFineTuning
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]
☆178Updated 4 months ago
jeffreysijuntan / lloco
The official repo for "LLoCo: Learning Long Contexts Offline"
☆118Updated last year
eddycmu / demystify-long-cot
☆326Updated 5 months ago
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆115Updated 9 months ago
THUDM / T1
RL Scaling and Test-Time Scaling (ICML'25)
☆112Updated 9 months ago
ReasoningTransfer / Transferability-of-LLM-Reasoning
☆104Updated last month
QwenLM / ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆451Updated 6 months ago
sunblaze-ucb / Intuitor
Code for the paper: "Learning to Reason without External Rewards"
☆373Updated 4 months ago
MLE-Dojo / MLE-Dojo
☆78Updated 3 weeks ago
zwhe99 / DeepMath
A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
☆269Updated last month
hughbzhang / o1_inference_scaling_laws
Replicating O1 inference-time scaling laws
☆90Updated 11 months ago
MiniMax-AI / SynLogic
[NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond
☆186Updated 4 months ago
agentica-project / verl-pipeline
Async pipelined version of Verl
☆125Updated 7 months ago
sail-sg / Precision-RL
Defeating the Training-Inference Mismatch via FP16
☆149Updated last week