facebookresearch / large_concept_model
Large Concept Models: Language modeling in a sentence representation space
☆1,713Updated this week
Alternatives and similar repositories for large_concept_model:
Users that are interested in large_concept_model are comparing it to the libraries listed below
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆625Updated last month
- NanoGPT (124M) in 3.4 minutes☆2,068Updated last week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆831Updated last month
- Code for BLT research paper☆1,314Updated this week
- Recipes to scale inference-time compute of open models☆932Updated this week
- ☆996Updated last month
- nanoGPT style version of Llama 3.1☆1,290Updated 5 months ago
- Everything about the SmolLM & SmolLM2 family of models☆1,554Updated last week
- Bringing BERT into modernity via both architecture changes and scaling☆1,045Updated last week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆1,992Updated last month
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,093Updated 3 weeks ago
- ReFT: Representation Finetuning for Language Models☆1,373Updated 2 weeks ago
- A reading list on LLM based Synthetic Data Generation 🔥☆969Updated 2 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆589Updated this week
- ☆2,289Updated this week
- Automated Design of Agentic Systems☆1,135Updated last week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,399Updated this week
- ☆2,802Updated 4 months ago
- The code used to train and run inference with the ColPali architecture.☆1,386Updated this week
- Sky-T1: Train your own O1 preview model within $450☆1,795Updated this week
- System 2 Reasoning Link Collection☆722Updated this week
- ☆2,180Updated last week
- A bibliography and survey of the papers surrounding o1☆1,042Updated 2 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,378Updated last month
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,217Updated last month
- Entropy Based Sampling and Parallel CoT Decoding☆3,197Updated 2 months ago
- LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning☆1,739Updated last week
- A PyTorch native library for large model training☆3,091Updated this week
- The n-gram Language Model☆1,363Updated 5 months ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,462Updated this week