allenai / bolmo-coreLinks
Code for Bolmo: Byteifying the Next Generation of Language Models
☆115Updated last month
Alternatives and similar repositories for bolmo-core
Users that are interested in bolmo-core are comparing it to the libraries listed below
Sorting:
- All information and news with respect to Falcon-H1 series☆106Updated 3 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆260Updated last week
- Data recipes and robust infrastructure for training AI agents☆84Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated last month
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆251Updated 2 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 4 months ago
- Official Project Page for Deep Delta Learning (https://huggingface.co/papers/2601.00417)☆320Updated this week
- Digital Red Queen: Adversarial Program Evolution in Core War with LLMs☆173Updated 2 weeks ago
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆478Updated 2 weeks ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- ☆95Updated last week
- Pivotal Token Search☆144Updated last month
- [ICLR'26] The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆324Updated this week
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆226Updated this week
- ☆29Updated 2 months ago
- ☆62Updated 6 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆115Updated 9 months ago
- LIMI: Less is More for Agency☆159Updated 3 months ago
- GRadient-INformed MoE☆264Updated last year
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- Official Project Page for Web World Models (https://arxiv.org/abs/2512.23676)☆80Updated 3 weeks ago
- Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"☆103Updated last week
- accompanying material for sleep-time compute paper☆119Updated 9 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- Marketplace ML experiment - training without backprop☆27Updated 4 months ago
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆50Updated last week
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆288Updated 2 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆88Updated 10 months ago
- ☆159Updated last month