cpldcpu / LRMTokenEconomyLinks
Measuring Thinking Efficiency in Reasoning Models - Research Repository
☆37Updated last month
Alternatives and similar repositories for LRMTokenEconomy
Users that are interested in LRMTokenEconomy are comparing it to the libraries listed below
Sorting:
- ☆19Updated 8 months ago
- Lego for GRPO☆30Updated 5 months ago
- ☆29Updated last week
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 2 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆165Updated 2 months ago
- ☆36Updated 3 months ago
- ☆62Updated 4 months ago
- ☆18Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last month
- ☆77Updated last week
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated last month
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆15Updated 3 weeks ago
- ☆62Updated 4 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated last year
- Efficient non-uniform quantization with GPTQ for GGUF☆53Updated 2 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆82Updated 7 months ago
- Marketplace ML experiment - training without backprop☆27Updated 2 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆53Updated 9 months ago
- ☆18Updated 11 months ago
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code merges☆53Updated 4 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 11 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Updated last year
- Train, tune, and infer Bamba model☆136Updated 5 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 6 months ago
- ☆21Updated 3 months ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆21Updated 4 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆35Updated last month
- Train your own SOTA deductive reasoning model☆108Updated 8 months ago
- craft post-training data recipes☆60Updated this week