tiiuae / Falcon-H1Links
All information and news with respect to Falcon-H1 series
☆95Updated 3 months ago
Alternatives and similar repositories for Falcon-H1
Users that are interested in Falcon-H1 are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆137Updated 4 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- accompanying material for sleep-time compute paper☆118Updated 8 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆114Updated 8 months ago
- Train, tune, and infer Bamba model☆137Updated 7 months ago
- Esoteric Language Models☆108Updated last month
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆450Updated 2 weeks ago
- PyTorch implementation of models from the Zamba2 series.☆186Updated 11 months ago
- RLP: Reinforcement as a Pretraining Objective☆222Updated 3 months ago
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆214Updated last week
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆174Updated 11 months ago
- Memory optimized Mixture of Experts☆72Updated 5 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆125Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- Official Repository of Native Parallel Reasoner☆92Updated 3 weeks ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆250Updated this week
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 5 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆284Updated last month
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆340Updated 3 weeks ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆104Updated 7 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆250Updated last month
- Universal Reasoning Model☆113Updated 2 weeks ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆86Updated this week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆85Updated 9 months ago
- QeRL enables RL for 32B LLMs on a single H100 GPU.☆469Updated last month
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 4 months ago
- Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support☆232Updated this week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆109Updated 10 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆305Updated 3 weeks ago
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆228Updated 2 months ago