Evanwu1125 / LiteCoTLinks
☆14Updated 6 months ago
Alternatives and similar repositories for LiteCoT
Users that are interested in LiteCoT are comparing it to the libraries listed below
Sorting:
- ☆38Updated 4 months ago
- ☆53Updated 2 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆25Updated 4 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆27Updated 10 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆80Updated last month
- ☆30Updated last month
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆46Updated 5 months ago
- ☆46Updated 2 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆124Updated 8 months ago
- ☆69Updated 6 months ago
- [ACL 2025 Findings] Official implementation of the paper "Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning".☆19Updated 9 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆88Updated 10 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆92Updated last month
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆66Updated 6 months ago
- ☆32Updated 5 months ago
- ☆173Updated 2 weeks ago
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25