subconscious-systems / TIMRUNLinks
☆52Updated 3 weeks ago
Alternatives and similar repositories for TIMRUN
Users that are interested in TIMRUN are comparing it to the libraries listed below
Sorting:
- Esoteric Language Models☆96Updated last month
- ☆85Updated last year
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆96Updated 2 weeks ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆105Updated 3 months ago
- ☆69Updated 11 months ago
- ☆104Updated 11 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆48Updated 4 months ago
- ☆54Updated 3 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆64Updated 5 months ago
- Memory optimized Mixture of Experts☆62Updated last month
- A repository for research on medium sized language models.☆77Updated last year
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆161Updated 5 months ago
- ☆27Updated 2 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆51Updated last week
- ☆51Updated 2 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆30Updated last month
- Here we will test various linear attention designs.☆62Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆34Updated 3 weeks ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 3 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆93Updated 3 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆64Updated last year
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆40Updated last month
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆47Updated last month
- ☆19Updated 6 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆56Updated last week
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated 11 months ago
- Resa: Transparent Reasoning Models via SAEs☆41Updated last month
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆54Updated 9 months ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆17Updated 6 months ago
- SSRL: Self-Search Reinforcement Learning☆131Updated 3 weeks ago