MozerWang / promISeLinks
[COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search
β23Updated 11 months ago
Alternatives and similar repositories for promISe
Users that are interested in promISe are comparing it to the libraries listed below
Sorting:
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β17Updated 2 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β78Updated 5 months ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agentsβ39Updated last month
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"β20Updated 8 months ago
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.β161Updated last month
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"β27Updated 2 weeks ago
- my commonly-used toolsβ56Updated 7 months ago
- γICLR 2025 π₯γThe code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overcoβ¦β44Updated 4 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modelingβ16Updated 7 months ago
- The reinforcement learning codes for dataset SPA-VLβ36Updated last year
- β78Updated last year
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"β26Updated 2 months ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ52Updated 2 months ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"β50Updated 3 months ago
- instruction-following benchmark for large reasoning modelsβ36Updated 2 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language β¦β35Updated 6 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Modelsβ58Updated 8 months ago
- A Self-Training Framework for Vision-Language Reasoningβ80Updated 6 months ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"β33Updated last year
- β103Updated last month
- More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Modelsβ43Updated 2 months ago
- (ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.β37Updated last month
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ56Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ81Updated 5 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.β24Updated 5 months ago
- [ICML 2025] Official Implementation of GLIDERβ51Updated 2 months ago
- β34Updated 10 months ago
- β46Updated 4 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!β47Updated 4 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ64Updated 3 weeks ago