ritzz-ai / PACSView external linksLinks
☆31Sep 12, 2025Updated 5 months ago
Alternatives and similar repositories for PACS
Users that are interested in PACS are comparing it to the libraries listed below
Sorting:
- ☆60Jan 12, 2026Updated last month
- ☆24Aug 19, 2025Updated 5 months ago
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated this week
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆47Oct 16, 2025Updated 3 months ago
- ☆16May 21, 2025Updated 8 months ago
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆22Jul 1, 2025Updated 7 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models☆59Jul 23, 2024Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated last year
- (ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"☆20May 15, 2025Updated 8 months ago
- Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆29Sep 19, 2025Updated 4 months ago
- Sotopia-RL: Reward Design for Social Intelligence☆46Jan 29, 2026Updated 2 weeks ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆41Sep 30, 2024Updated last year
- instruction-following benchmark for large reasoning models☆44Aug 9, 2025Updated 6 months ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆45Dec 25, 2025Updated last month
- ☆17Jul 5, 2022Updated 3 years ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆21Dec 16, 2024Updated last year
- [ICLR 2026] Adaptive Social Learning via Mode Policy Optimization for Language Agents☆48Feb 2, 2026Updated last week
- ☆72Jun 10, 2025Updated 8 months ago
- Hands-On Image Processing with Python, Second Edition, Published by Packt☆26Updated this week
- ☆17May 19, 2023Updated 2 years ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Dec 25, 2025Updated last month
- From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems☆17Nov 23, 2025Updated 2 months ago
- ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]☆20Aug 21, 2025Updated 5 months ago
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆23Mar 29, 2025Updated 10 months ago
- [COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search☆23Aug 26, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆27May 13, 2025Updated 9 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆32Jul 25, 2025Updated 6 months ago
- ☆15Feb 21, 2024Updated last year
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆29Jan 10, 2026Updated last month
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆78Nov 25, 2024Updated last year
- a survey on deep research☆47Sep 9, 2025Updated 5 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 2 months ago
- ☆33Jul 15, 2025Updated 6 months ago
- REverse-Engineered Reasoning for Open-Ended Generation☆91Sep 10, 2025Updated 5 months ago
- ☆21Dec 6, 2025Updated 2 months ago
- Test-time Scaling for VAR models☆31Sep 19, 2025Updated 4 months ago