ruocwang / mixture-of-promptsLinks
[ICML 2024] One Prompt is Not Enough: Automated Construction of a Mixture-of-Expert Prompts - TurningPoint AI
☆31Updated last year
Alternatives and similar repositories for mixture-of-prompts
Users that are interested in mixture-of-prompts are comparing it to the libraries listed below
Sorting:
- ☆213Updated 7 months ago
- ☆138Updated 10 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆113Updated 5 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆86Updated 9 months ago
- Discriminative Constrained Optimization for Reinforcing Large Reasoning Models☆49Updated 2 months ago
- ☆52Updated last year
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆98Updated last year
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆254Updated 8 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆65Updated 11 months ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆140Updated last year
- ☆220Updated 9 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆69Updated 10 months ago
- A Sober Look at Language Model Reasoning☆92Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 9 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆71Updated 11 months ago
- ☆346Updated 5 months ago
- ☆108Updated last year
- ☆50Updated 11 months ago
- ☆226Updated 10 months ago
- ☆143Updated 4 months ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆150Updated 2 months ago
- ☆117Updated 11 months ago
- ☆104Updated 7 months ago
- ☆107Updated last month
- [ACL 2025] Knowledge Unlearning for Large Language Models☆47Updated 3 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated last year
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆156Updated 6 months ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆20Updated 2 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆34Updated last year
- RL Scaling and Test-Time Scaling (ICML'25)☆112Updated 11 months ago