MozerWang / promISeLinks
[COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search
β23Updated last year
Alternatives and similar repositories for promISe
Users that are interested in promISe are comparing it to the libraries listed below
Sorting:
- (ACL 2025) π₯π₯π₯Code for "Empowering Multimodal Large Language Models with Evol-Instruct"β17Updated 3 months ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agentsβ42Updated 2 months ago
- Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"β22Updated 9 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.β25Updated 6 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.β81Updated 6 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language β¦β35Updated 7 months ago
- The reinforcement learning codes for dataset SPA-VLβ37Updated last year
- β46Updated 4 months ago
- β18Updated 5 months ago
- More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Modelsβ48Updated 3 months ago
- [ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Modelsβ56Updated last year
- β43Updated last week
- Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ52Updated 3 months ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"β50Updated 3 months ago
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-ofβ¦β43Updated 3 months ago
- my commonly-used toolsβ61Updated 7 months ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"β33Updated last year
- β81Updated last year
- β34Updated 11 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"β29Updated 4 months ago
- A comprehensive collection of process reward models.β104Updated last month
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.β79Updated 9 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Modelsβ59Updated 8 months ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"β26Updated 3 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".β128Updated 10 months ago
- Code of paper 'UltraIF: Advancing Instruction Following from the Wild'.β17Updated 4 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ84Updated 6 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.β153Updated 2 weeks ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ65Updated last month
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"β32Updated 6 months ago