shangshang-wang / TinaView external linksLinks
[ICLR 2026] Tina: Tiny Reasoning Models via LoRA
β320Sep 23, 2025Updated 4 months ago
Alternatives and similar repositories for Tina
Users that are interested in Tina are comparing it to the libraries listed below
Sorting:
- DPO, but faster πβ47Dec 6, 2024Updated last year
- β15Apr 26, 2025Updated 9 months ago
- Understanding R1-Zero-Like Training: A Critical Perspectiveβ1,209Aug 27, 2025Updated 5 months ago
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Exampleβ410Nov 21, 2025Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]β218Nov 27, 2025Updated 2 months ago
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusionβ14Mar 17, 2025Updated 11 months ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learningβ989Sep 26, 2025Updated 4 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,322May 16, 2025Updated 9 months ago
- Reinforcing General Reasoning without Verifiersβ97Jun 24, 2025Updated 7 months ago
- A series of technical report on Slow Thinking with LLMβ759Aug 13, 2025Updated 6 months ago
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β628Jan 29, 2026Updated 2 weeks ago
- β522Feb 4, 2026Updated last week
- β14Apr 14, 2025Updated 10 months ago
- π LLM-I: Transform LLMs into natural interleaved multimodal creators! β¨ Tool-use framework supporting image search, generation, code exβ¦β41Oct 20, 2025Updated 3 months ago
- β16Jul 23, 2024Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimizationβ51Jul 15, 2025Updated 7 months ago
- Official Repo for Open-Reasoner-Zeroβ2,085Jun 2, 2025Updated 8 months ago
- GRadient-INformed MoEβ264Sep 25, 2024Updated last year
- Train your own SOTA deductive reasoning modelβ107Mar 6, 2025Updated 11 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ4,021Nov 13, 2025Updated 3 months ago
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"β445Oct 16, 2024Updated last year
- β67May 23, 2025Updated 8 months ago
- AllenAI's post-training codebaseβ3,573Updated this week
- An Open-source RL System from ByteDance Seed and Tsinghua AIRβ1,732May 11, 2025Updated 9 months ago
- Async RL Training at Scaleβ1,071Updated this week
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025β33May 1, 2025Updated 9 months ago
- [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillationβ71Oct 17, 2025Updated 4 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"β273Oct 16, 2025Updated 4 months ago
- Simple RL training for reasoningβ3,827Dec 23, 2025Updated last month
- β19Jan 3, 2025Updated last year
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewardsβ36Oct 3, 2025Updated 4 months ago
- Fine-tuning Quantized Neural Networks with Zeroth-order Optimizationβ15Sep 17, 2025Updated 5 months ago
- Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).β12Sep 22, 2025Updated 4 months ago
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisiβ¦β14Jun 6, 2025Updated 8 months ago
- β43Apr 22, 2025Updated 9 months ago
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Modelsβ140Dec 17, 2025Updated 2 months ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Thinkβ252Sep 26, 2025Updated 4 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]β42Aug 25, 2025Updated 5 months ago
- β40May 27, 2025Updated 8 months ago