facebookresearch / large_concept_modelLinks
Large Concept Models: Language modeling in a sentence representation space
β2,324Updated 11 months ago
Alternatives and similar repositories for large_concept_model
Users that are interested in large_concept_model are comparing it to the libraries listed below
Sorting:
- Code for BLT research paperβ2,024Updated 2 months ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,179Updated 11 months ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.β856Updated 3 months ago
- Sky-T1: Train your own O1 preview model within $450β3,367Updated 6 months ago
- AllenAI's post-training codebaseβ3,515Updated this week
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,449Updated 5 months ago
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modelingβ939Updated last month
- Stanford NLP Python library for Representation Finetuning (ReFT)β1,551Updated this week
- β2,529Updated this week
- Recipes to scale inference-time compute of open modelsβ1,123Updated 7 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β3,039Updated 3 weeks ago
- Tool for generating high quality Synthetic datasetsβ1,463Updated 2 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ2,251Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.β3,266Updated 5 months ago
- β1,032Updated last year
- Bringing BERT into modernity via both architecture changes and scalingβ1,607Updated 6 months ago
- A library for advanced large language model reasoningβ2,319Updated 7 months ago
- Our library for RL environments + evalsβ3,730Updated this week
- Pretraining and inference code for a large-scale depth-recurrent language modelβ859Updated 2 weeks ago
- Synthetic data curation for post-training and structured data extractionβ1,594Updated last week
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorchβ1,864Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,740Updated 8 months ago
- Everything about the SmolLM and SmolVLM family of modelsβ3,552Updated last month
- Official PyTorch implementation for "Large Language Diffusion Models"β3,473Updated 2 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,812Updated this week
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ2,137Updated last year
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineeringβ1,270Updated 3 weeks ago
- Textbook on reinforcement learning from human feedbackβ1,396Updated this week
- Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'β1,637Updated last month
- Democratizing Reinforcement Learning for LLMsβ4,965Updated this week