cpldcpu / LRMTokenEconomyLinks
Measuring Thinking Efficiency in Reasoning Models - Research Repository
☆37Updated last week
Alternatives and similar repositories for LRMTokenEconomy
Users that are interested in LRMTokenEconomy are comparing it to the libraries listed below
Sorting:
- Lego for GRPO☆29Updated 4 months ago
- ☆27Updated 3 months ago
- ☆36Updated 2 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 8 months ago
- ☆19Updated 7 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated last month
- ☆17Updated 2 months ago
- Train your own SOTA deductive reasoning model☆107Updated 7 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 6 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 11 months ago
- ☆62Updated 2 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆94Updated last week
- ☆68Updated 4 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 7 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 3 months ago
- Marketplace ML experiment - training without backprop☆25Updated 3 weeks ago
- ☆22Updated 2 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 5 months ago
- Repository to create traveling waves integrate special information through time☆55Updated last month
- entropix style sampling + GUI☆27Updated 11 months ago
- ☆98Updated 3 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆51Updated 7 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆89Updated 4 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆187Updated last month
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆43Updated 2 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆109Updated 5 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆99Updated 2 months ago
- ☆13Updated 5 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 11 months ago