cpldcpu / LRMTokenEconomyLinks
Measuring Thinking Efficiency in Reasoning Models - Research Repository
☆37Updated 3 weeks ago
Alternatives and similar repositories for LRMTokenEconomy
Users that are interested in LRMTokenEconomy are comparing it to the libraries listed below
Sorting:
- ☆18Updated 3 months ago
- ☆27Updated 4 months ago
- ☆22Updated 3 months ago
- Lego for GRPO☆30Updated 5 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆161Updated 2 months ago
- ☆58Updated 5 months ago
- ☆19Updated 7 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆79Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last week
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated last month
- ☆60Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆97Updated last week
- ☆36Updated 2 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 5 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 3 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last year
- ☆15Updated 4 months ago
- ☆62Updated 3 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆127Updated 2 months ago
- Sparse Inferencing for transformer based LLMs☆201Updated 2 months ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated 2 weeks ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆103Updated 2 weeks ago
- Train, tune, and infer Bamba model☆135Updated 4 months ago
- Train your own SOTA deductive reasoning model☆109Updated 7 months ago
- open source alpha evolve☆66Updated 5 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆107Updated 7 months ago
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆32Updated 3 weeks ago
- LongCodeZip: Compress Long Context for Code Language Models [ASE2025]☆110Updated last week
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆52Updated 8 months ago