facebookresearch / large_concept_modelLinks

Large Concept Models: Language modeling in a sentence representation space

☆2,254

Alternatives and similar repositories for large_concept_model

Users that are interested in large_concept_model are comparing it to the libraries listed below

Sorting:

facebookresearch / blt
Code for BLT research paper
☆1,760Updated 2 months ago
facebookresearch / SONAR
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.
☆792Updated this week
SakanaAI / self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,131Updated 6 months ago
facebookresearch / coconut
Training Large Language Model to Reason in a Continuous Latent Space
☆1,216Updated 6 months ago
NovaSky-AI / SkyThought
Sky-T1: Train your own O1 preview model within $450
☆3,313Updated 3 weeks ago
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,110Updated 2 months ago
zou-group / textgrad
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
☆2,796Updated last week
stanfordnlp / pyreft
Stanford NLP Python library for Representation Finetuning (ReFT)
☆1,500Updated 5 months ago
huggingface / lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆1,766Updated this week
huggingface / smollm
Everything about the SmolLM and SmolVLM family of models
☆3,032Updated this week
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,821Updated this week
maitrix-org / llm-reasoners
A library for advanced large language model reasoning
☆2,193Updated last month
policy-gradient / GRPO-Zero
Implementing DeepSeek R1's GRPO algorithm from scratch
☆1,496Updated 3 months ago
allenai / open-instruct
AllenAI's post-training codebase
☆3,083Updated this week
seal-rg / recurrent-pretraining
Pretraining and inference code for a large-scale depth-recurrent language model
☆806Updated 2 weeks ago
wasiahmad / Awesome-LLM-Synthetic-Data
A reading list on LLM based Synthetic Data Generation 🔥
☆1,369Updated last month
bespokelabsai / curator
Synthetic data curation for post-training and structured data extraction
☆1,464Updated 3 weeks ago
AnswerDotAI / ModernBERT
Bringing BERT into modernity via both architecture changes and scaling
☆1,469Updated last month
willccbb / verifiers
Verifiers for LLM Reinforcement Learning
☆1,621Updated this week
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆899Updated 3 months ago
safety-research / circuit-tracer
☆2,212Updated this week
GAIR-NLP / LIMO
[COLM 2025] LIMO: Less is More for Reasoning
☆986Updated 3 weeks ago
natolambert / rlhf-book
Textbook on reinforcement learning from human feedback
☆1,137Updated 2 weeks ago
ShengranHu / ADAS
[ICLR 2025] Automated Design of Agentic Systems
☆1,392Updated 6 months ago
SakanaAI / continuous-thought-machines
Continuous Thought Machines, because thought takes time and reasoning is a process.
☆1,223Updated 2 weeks ago
KellerJordan / modded-nanogpt
NanoGPT (124M) in 3 minutes
☆2,965Updated 2 weeks ago
openai / mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
☆815Updated last month
trotsky1997 / MathBlackBox
☆1,028Updated 7 months ago
ML-GSAI / LLaDA
Official PyTorch implementation for "Large Language Diffusion Models"
☆2,630Updated last month