facebookresearch / large_concept_modelLinks
Large Concept Models: Language modeling in a sentence representation space
β2,291Updated 8 months ago
Alternatives and similar repositories for large_concept_model
Users that are interested in large_concept_model are comparing it to the libraries listed below
Sorting:
- Code for BLT research paperβ1,995Updated 5 months ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,155Updated 8 months ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.β826Updated last week
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,297Updated 2 months ago
- Recipes to scale inference-time compute of open modelsβ1,111Updated 5 months ago
- Stanford NLP Python library for Representation Finetuning (ReFT)β1,514Updated 8 months ago
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.β3,012Updated 2 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ2,021Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,903Updated this week
- Bringing BERT into modernity via both architecture changes and scalingβ1,546Updated 3 months ago
- Sky-T1: Train your own O1 preview model within $450β3,341Updated 3 months ago
- AllenAI's post-training codebaseβ3,252Updated this week
- Everything about the SmolLM and SmolVLM family of modelsβ3,332Updated last month
- Official PyTorch implementation for "Large Language Diffusion Models"β3,079Updated last week
- β2,385Updated last week
- β1,035Updated 10 months ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modelingβ915Updated 5 months ago
- Pretraining and inference code for a large-scale depth-recurrent language modelβ836Updated last week
- Environments for LLM Reinforcement Learningβ3,338Updated this week
- Tool for generating high quality Synthetic datasetsβ1,306Updated 3 weeks ago
- Minimalistic large language model 3D-parallelism trainingβ2,267Updated last month
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'β1,599Updated 8 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineeringβ1,029Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,621Updated 6 months ago
- Synthetic data curation for post-training and structured data extractionβ1,535Updated 2 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,673Updated last week
- β684Updated 5 months ago
- OLMoE: Open Mixture-of-Experts Language Modelsβ886Updated last month
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"β1,362Updated 8 months ago
- [COLM 2025] LIMO: Less is More for Reasoningβ1,037Updated 2 months ago