facebookresearch / large_concept_modelLinks
Large Concept Models: Language modeling in a sentence representation space
☆2,240Updated 5 months ago
Alternatives and similar repositories for large_concept_model
Users that are interested in large_concept_model are comparing it to the libraries listed below
Sorting:
- Code for BLT research paper☆1,725Updated last month
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆787Updated 3 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,123Updated 5 months ago
- Sky-T1: Train your own O1 preview model within $450☆3,300Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,719Updated last week
- Training Large Language Model to Reason in a Continuous Latent Space☆1,185Updated 5 months ago
- Recipes to scale inference-time compute of open models☆1,101Updated last month
- Bringing BERT into modernity via both architecture changes and scaling☆1,430Updated last week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,469Updated 2 months ago
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,493Updated 5 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,722Updated this week
- Tool for generating high quality Synthetic datasets☆1,010Updated this week
- Everything about the SmolLM and SmolVLM family of models☆2,803Updated this week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆888Updated 2 months ago
- AllenAI's post-training codebase☆3,044Updated this week
- ☆2,136Updated this week
- A reading list on LLM based Synthetic Data Generation 🔥☆1,338Updated last month
- Synthetic data curation for post-training and structured data extraction☆1,434Updated this week
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆3,677Updated this week
- Pretraining code for a large-scale depth-recurrent language model☆793Updated last month
- ☆1,025Updated 6 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,507Updated last week
- Verifiers for LLM Reinforcement Learning☆1,495Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,800Updated this week
- Democratizing Reinforcement Learning for LLMs☆3,744Updated this week
- nanoGPT style version of Llama 3.1☆1,394Updated 11 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆2,530Updated 3 weeks ago
- NanoGPT (124M) in 3 minutes☆2,774Updated 3 weeks ago
- ☆673Updated 2 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆795Updated 3 weeks ago