facebookresearch / bltLinks

Code for BLT research paper

☆2,010

Alternatives and similar repositories for blt

Users that are interested in blt are comparing it to the libraries listed below

Sorting:

facebookresearch / large_concept_model
Large Concept Models: Language modeling in a sentence representation space
☆2,308Updated 10 months ago
SakanaAI / self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,172Updated 10 months ago
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆932Updated 2 weeks ago
facebookresearch / coconut
Training Large Language Model to Reason in a Continuous Latent Space
☆1,367Updated 3 months ago
lucidrains / titans-pytorch
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
☆1,533Updated this week
KellerJordan / Muon
Muon is an optimizer for hidden layers in neural networks
☆2,056Updated last week
facebookresearch / SONAR
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
☆845Updated last month
seal-rg / recurrent-pretraining
Pretraining and inference code for a large-scale depth-recurrent language model
☆850Updated last month
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,118Updated 6 months ago
policy-gradient / GRPO-Zero
Implementing DeepSeek R1's GRPO algorithm from scratch
☆1,682Updated 7 months ago
huggingface / nanotron
Minimalistic large language model 3D-parallelism training
☆2,351Updated last week
ML-GSAI / LLaDA
Official PyTorch implementation for "Large Language Diffusion Models"
☆3,333Updated 3 weeks ago
DreamLM / Dream
Dream 7B, a large diffusion language model
☆1,094Updated last week
goombalab / hnet
H-Net: Hierarchical Network with Dynamic Chunking
☆785Updated last week
huggingface / picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
☆1,901Updated 3 months ago
AnswerDotAI / ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
☆1,572Updated 5 months ago
open-thought / reasoning-gym
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,242Updated 2 weeks ago
huggingface / lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆2,141Updated last week
SakanaAI / continuous-thought-machines
Continuous Thought Machines, because thought takes time and reasoning is a process.
☆1,530Updated last month
stanfordnlp / pyreft
Stanford NLP Python library for Representation Finetuning (ReFT)
☆1,538Updated 9 months ago
KellerJordan / modded-nanogpt
NanoGPT (124M) in 3 minutes
☆3,911Updated last week
natolambert / rlhf-book
Textbook on reinforcement learning from human feedback
☆1,338Updated this week
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,372Updated 4 months ago
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆916Updated 2 months ago
SonyResearch / micro_diffusion
Official repository for our work on micro-budget training of large-scale diffusion models.
☆1,534Updated 10 months ago
ZHZisZZ / dllm
dLLM: Simple Diffusion Language Modeling
☆1,069Updated this week
Haiyang-W / TokenFormer
[ICLR2025 Spotlight🔥] Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
☆580Updated 9 months ago
huggingface / nanoVLM
The simplest, fastest repository for training/finetuning small-sized VLMs.
☆4,331Updated last month
trotsky1997 / MathBlackBox
☆1,035Updated 11 months ago
open-thought / system-2-research
System 2 Reasoning Link Collection
☆859Updated 8 months ago