MozerWang / promISeLinks
[COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search
β23Updated last year
Alternatives and similar repositories for promISe
Users that are interested in promISe are comparing it to the libraries listed below
Sorting:
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β19Updated 7 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ58Updated last year
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β85Updated 10 months ago
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimalityβ18Updated 2 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language β¦β38Updated 11 months ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agentsβ46Updated 6 months ago
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improvingβ24Updated 4 months ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ62Updated 7 months ago
- The reinforcement learning codes for dataset SPA-VLβ43Updated last year
- Code for Research Project TLDRβ25Updated 5 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.β28Updated 10 months ago
- Code and data for "Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?" (ACL 2024)β32Updated last year
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"β61Updated 3 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ88Updated 10 months ago
- my commonly-used toolsβ63Updated last year
- Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"β20Updated 2 months ago
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"β24Updated last year
- [2025-TMLR] A Survey on the Honesty of Large Language Modelsβ63Updated last year
- β33Updated 7 months ago
- γICLR 2025 π₯γThe code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overcoβ¦β47Updated 9 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"β32Updated 9 months ago
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.β176Updated 6 months ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"β57Updated 8 months ago
- BeHonest: Benchmarking Honesty in Large Language Modelsβ34Updated last year
- β31Updated 4 months ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to β¦β53Updated 2 weeks ago
- [ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPOβ61Updated 8 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modelingβ20Updated last year
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.β20Updated 9 months ago
- β36Updated last year