MozerWang / promISeLinks
[COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search
β23Updated last year
Alternatives and similar repositories for promISe
Users that are interested in promISe are comparing it to the libraries listed below
Sorting:
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β18Updated 5 months ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agentsβ46Updated 4 months ago
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"β23Updated 11 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β83Updated 8 months ago
- β18Updated 3 weeks ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"β26Updated 5 months ago
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static tβ¦β45Updated last month
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"β30Updated 3 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ86Updated 8 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modelingβ20Updated 10 months ago
- Official code for our paper "The Hallucination Dilemma: Factuality-Aware Reinforcement Learning for Large Reasoning Models"β16Updated 5 months ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"β33Updated last year
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ55Updated 5 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-ofβ¦β57Updated 5 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ69Updated 3 months ago
- β46Updated 6 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"β47Updated last month
- AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learningβ46Updated 4 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language β¦β36Updated 9 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ57Updated last year
- Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)β24Updated last year
- instruction-following benchmark for large reasoning modelsβ45Updated 2 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"β31Updated 6 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Modelsβ61Updated 10 months ago
- Code for Research Project TLDRβ23Updated 3 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!β54Updated 7 months ago
- [EMNLP 2024 Findingsπ₯] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inβ¦β103Updated 11 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?β36Updated 4 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.β27Updated 8 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentationβ95Updated last month