cpldcpu / LRMTokenEconomyLinks
Measuring Thinking Efficiency in Reasoning Models - Research Repository
☆37Updated last week
Alternatives and similar repositories for LRMTokenEconomy
Users that are interested in LRMTokenEconomy are comparing it to the libraries listed below
Sorting:
- ☆29Updated last month
- ☆19Updated 9 months ago
- Lego for GRPO☆30Updated 6 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated 3 months ago
- ☆36Updated 4 months ago
- ☆62Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆239Updated this week
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆167Updated 3 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆54Updated 10 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆36Updated 3 weeks ago
- ☆18Updated 4 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆15Updated last week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated last month
- EvaByte: Efficient Byte-level Language Models at Scale☆111Updated 7 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆108Updated 9 months ago
- open source alpha evolve☆66Updated 6 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 2 months ago
- ☆21Updated 4 months ago
- SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning☆89Updated 3 weeks ago
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆45Updated 4 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆84Updated 8 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 5 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last year
- AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents☆34Updated 2 months ago
- ☆68Updated 6 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 7 months ago
- ☆88Updated last month