facebookresearch / PhysicsLM4Links

Physics of Language Models, Part 4

☆260

Alternatives and similar repositories for PhysicsLM4

Users that are interested in PhysicsLM4 are comparing it to the libraries listed below

Sorting:

sustcsonglin / linear-attention-and-beyond-slides
☆99Updated 9 months ago
fla-org / flame
🔥 A minimal training framework for scaling FLA models
☆311Updated 2 weeks ago
facebookresearch / iGSM
The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…
☆80Updated 10 months ago
lucidrains / coconut-pytorch
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
☆180Updated 5 months ago
Parallel-Reasoning / APR
[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models
☆134Updated 3 months ago
HKUNLP / diffusion-vs-ar
[ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"
☆85Updated 9 months ago
yihedeng9 / rlhf-summary-notes
A brief and partial summary of RLHF algorithms.
☆139Updated 8 months ago
ScalingIntelligence / large_language_monkeys
☆109Updated last year
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆240Updated 2 months ago
ML-GSAI / SMDM
Official PyTorch implementation for ICLR2025 paper "Scaling up Masked Diffusion Models on Text"
☆343Updated 11 months ago
facebookresearch / RAM
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆300Updated 3 weeks ago
yaof20 / Flash-RL
Implementation for FP8/INT8 Rollout for RL training without performence drop.
☆275Updated 3 weeks ago
agentica-project / verl-pipeline
Async pipelined version of Verl
☆125Updated 7 months ago
nil0x9 / flash-muon
Flash-Muon: An Efficient Implementation of Muon Optimizer
☆212Updated 5 months ago
ServiceNow / PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆316Updated this week
axon-rl / gem
A Gym for Agentic LLMs
☆371Updated 3 weeks ago
thu-ml / ReMoE
[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.
☆99Updated 11 months ago
sail-sg / Precision-RL
Defeating the Training-Inference Mismatch via FP16
☆159Updated 2 weeks ago
NVIDIA / ngpt
Normalized Transformer (nGPT)
☆194Updated last year
JinjieNi / dlms-are-super-data-learners
The official github repo for "Diffusion Language Models are Super Data Learners".
☆205Updated 3 weeks ago
jzhang38 / LongMamba
Some preliminary explorations of Mamba's context scaling.
☆217Updated last year
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆174Updated 5 months ago
shawntan / stickbreaking-attention
Stick-breaking attention
☆61Updated 5 months ago
princeton-nlp / ProLong
Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"
☆240Updated 2 months ago
ryoungj / BoLT
Code for "Reasoning to Learn from Latent Thoughts"
☆122Updated 8 months ago
Infini-AI-Lab / Multiverse
☆103Updated 2 months ago
McGill-NLP / VinePPO
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
☆181Updated 6 months ago
mnoukhov / async_rlhf
Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models
☆67Updated 7 months ago
princeton-nlp / HELMET
The HELMET Benchmark
☆186Updated 3 months ago
ypwang61 / One-Shot-RLVR
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
☆381Updated last week