ritzz-ai / PACSLinks
☆20Updated this week
Alternatives and similar repositories for PACS
Users that are interested in PACS are comparing it to the libraries listed below
Sorting:
- ☆30Updated last month
- ☆25Updated last week
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆58Updated 5 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆35Updated 4 months ago
- (ACL 2025 oral) SCOPE: Optimizing KV Cache Compression in Long-context Generation☆32Updated 3 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆20Updated 3 weeks ago
- ☆29Updated 3 weeks ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆61Updated last week
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆34Updated last week
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆23Updated 2 months ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆21Updated 3 months ago
- ☆14Updated 8 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆22Updated last month
- ☆34Updated 3 weeks ago
- ARM: Adaptive Reasoning Model☆47Updated last month
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆17Updated last week
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆40Updated 3 weeks ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆29Updated 3 weeks ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆85Updated 6 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆87Updated 6 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆70Updated 3 months ago
- ☆15Updated 11 months ago
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆25Updated 2 months ago
- ☆27Updated 3 months ago
- ☆23Updated 3 months ago
- ☆130Updated 2 weeks ago
- ☆18Updated 2 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆93Updated 4 months ago
- SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆90Updated 2 months ago
- Segment Policy Optimization: Improved Credit Assignment in Reinforcement Learning for LLMs☆29Updated last month