facebookresearch / large_concept_model
Large Concept Models: Language modeling in a sentence representation space
β1,927Updated 3 weeks ago
Alternatives and similar repositories for large_concept_model:
Users that are interested in large_concept_model are comparing it to the libraries listed below
- A Self-adaptation Frameworkπ that adapts LLMs for unseen tasks in real-time!β948Updated 3 weeks ago
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.β685Updated 2 months ago
- Code for BLT research paperβ1,400Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.β2,085Updated 3 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Spaceβ877Updated 3 weeks ago
- Bringing BERT into modernity via both architecture changes and scalingβ1,199Updated last week
- Sky-T1: Train your own O1 preview model within $450β2,641Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"β841Updated this week
- NanoGPT (124M) in 3 minutesβ2,294Updated this week
- Recipes to scale inference-time compute of open modelsβ1,002Updated last month
- nanoGPT style version of Llama 3.1β1,316Updated 6 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard aβ¦β1,019Updated last month
- [ICLR 2025] Automated Design of Agentic Systemsβ1,188Updated 3 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,448Updated this week
- Everything about the SmolLM2 and SmolVLM family of modelsβ1,888Updated 2 weeks ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ1,160Updated this week
- System 2 Reasoning Link Collectionβ794Updated 2 weeks ago
- β1,006Updated 2 months ago
- PyTorch native post-training libraryβ4,856Updated this week
- AllenAI's post-training codebaseβ2,657Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMsβ3,387Updated this week
- Unofficial implementation of Titans, SOTA memory for transformers, in Pytorchβ1,079Updated last week
- A reading list on LLM based Synthetic Data Generation π₯β1,141Updated 3 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. πβ1,187Updated 2 weeks ago
- Optimizing inference proxy for LLMsβ2,040Updated this week
- β1,480Updated this week
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.β1,930Updated 6 months ago