long-horizon-execution / measuring-executionLinks
☆49Updated 3 months ago
Alternatives and similar repositories for measuring-execution
Users that are interested in measuring-execution are comparing it to the libraries listed below
Sorting:
- ☆29Updated last month
- Esoteric Language Models☆108Updated last month
- Official Repository of Native Parallel Reasoner☆92Updated 3 weeks ago
- ☆109Updated 3 months ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆63Updated this week
- Official repo of paper LM2☆46Updated 10 months ago
- [NeurIPS'25 Oral] Query-agnostic KV cache eviction: 3–4× reduction in memory and 2× decrease in latency (Qwen3/2.5, Gemma3, LLaMA3)☆176Updated last week
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆228Updated 2 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆284Updated last month
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆137Updated 4 months ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆165Updated last month
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 4 months ago
- Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More☆34Updated 7 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆85Updated 9 months ago
- ☆52Updated 6 months ago
- SSRL: Self-Search Reinforcement Learning☆201Updated 4 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Updated 8 months ago
- Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models☆41Updated 8 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆41Updated last week
- ☆38Updated last year
- ☆365Updated 2 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆215Updated 2 months ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆56Updated 2 weeks ago
- ☆41Updated 7 months ago
- ☆152Updated 3 months ago
- Official JAX implementation of End-to-End Test-Time Training for Long Context☆214Updated last week
- ☆91Updated last year
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆87Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆250Updated this week
- ☆19Updated 10 months ago