cpldcpu / LRMTokenEconomyLinks
Measuring Thinking Efficiency in Reasoning Models - Research Repository
☆37Updated 3 weeks ago
Alternatives and similar repositories for LRMTokenEconomy
Users that are interested in LRMTokenEconomy are comparing it to the libraries listed below
Sorting:
- ☆19Updated 9 months ago
- ☆29Updated last month
- ☆36Updated 5 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- Lego for GRPO☆30Updated 7 months ago
- SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning☆91Updated last month
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆37Updated last month
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆122Updated 2 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 4 months ago
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- Official implementation of GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)☆66Updated this week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆109Updated 9 months ago
- ☆19Updated 5 months ago
- ☆150Updated 2 weeks ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 2 months ago
- ☆66Updated 9 months ago
- ☆62Updated 5 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 7 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated 3 weeks ago
- ☆68Updated 6 months ago
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆22Updated 2 months ago
- ☆105Updated 6 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆112Updated 8 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆249Updated this week
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last year
- open source alpha evolve☆67Updated 7 months ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated last week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆104Updated 7 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆84Updated 9 months ago
- alternative way to calculating self attention☆18Updated last year