MozerWang / promISeLinks
[COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search
β23Updated last year
Alternatives and similar repositories for promISe
Users that are interested in promISe are comparing it to the libraries listed below
Sorting:
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β18Updated 6 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β84Updated 9 months ago
- The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static tβ¦β47Updated 2 months ago
- Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"β19Updated last month
- β18Updated last month
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agentsβ46Updated 4 months ago
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ60Updated 6 months ago
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"β23Updated last year
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modelingβ20Updated 11 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language β¦β36Updated 10 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!β52Updated 8 months ago
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.β177Updated 4 months ago
- The reinforcement learning codes for dataset SPA-VLβ42Updated last year
- Code for Research Project TLDRβ24Updated 4 months ago
- γICLR 2025 π₯γThe code for Consistent In-Context Editing, an approach for tuning language models through contextual distributions, overcoβ¦β45Updated 7 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ87Updated 9 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-ofβ¦β64Updated 6 months ago
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.β83Updated last year
- β37Updated last year
- β29Updated 5 months ago
- β109Updated 2 months ago
- my commonly-used toolsβ63Updated 10 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.β27Updated 9 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ69Updated 4 months ago
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)β44Updated 4 months ago
- β84Updated last year
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20β¦β12Updated last year
- β56Updated 4 months ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Modelsβ69Updated 6 months ago
- β59Updated last year