facebookresearch / large_concept_modelLinks
Large Concept Models: Language modeling in a sentence representation space
β2,206Updated 4 months ago
Alternatives and similar repositories for large_concept_model
Users that are interested in large_concept_model are comparing it to the libraries listed below
Sorting:
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.β765Updated 2 months ago
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β1,066Updated 4 months ago
- Code for BLT research paperβ1,664Updated last week
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,135Updated 4 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,563Updated last week
- β991Updated this week
- A reading list on LLM based Synthetic Data Generation π₯β1,280Updated 2 weeks ago
- [ICLR 2025] Automated Design of Agentic Systemsβ1,311Updated 4 months ago
- Recipes to scale inference-time compute of open modelsβ1,087Updated last week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.β2,591Updated 2 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.β3,003Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratchβ1,390Updated last month
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modelingβ876Updated last month
- Tool for generating high quality Synthetic datasetsβ878Updated last week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,724Updated this week
- System 2 Reasoning Link Collectionβ834Updated 2 months ago
- Pretraining code for a large-scale depth-recurrent language modelβ770Updated this week
- A library for advanced large language model reasoningβ2,132Updated last month
- LIMO: Less is More for Reasoningβ953Updated last month
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,385Updated 4 months ago
- Sky-T1: Train your own O1 preview model within $450β3,254Updated 2 weeks ago
- nanoGPT style version of Llama 3.1β1,372Updated 9 months ago
- NanoGPT (124M) in 3 minutesβ2,600Updated this week
- PyTorch native post-training libraryβ5,217Updated last week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineeringβ728Updated 2 weeks ago
- Verifiers for LLM Reinforcement Learningβ1,057Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,390Updated this week
- β1,024Updated 5 months ago
- An Open Large Reasoning Model for Real-World Solutionsβ1,494Updated this week
- β3,553Updated 2 weeks ago