Prompt-R1: Collaborative Automatic Prompting Framework via End-to-end Reinforcement Learning
☆56Feb 24, 2026Updated 3 weeks ago
Alternatives and similar repositories for Prompt-R1
Users that are interested in Prompt-R1 are comparing it to the libraries listed below
Sorting:
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆28Jul 14, 2025Updated 8 months ago
- Source code for SWIFT, an efficient reward model.☆19Jan 13, 2026Updated 2 months ago
- A Multi-Agent Approach Integrating Socratic Guidance for Automated Prompt Optimization☆17Dec 15, 2025Updated 3 months ago
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆35Dec 6, 2025Updated 3 months ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆25Jan 5, 2026Updated 2 months ago
- Code and data for QueryAgent(ACL 2024)☆20Dec 19, 2024Updated last year
- The official implement of "Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings"☆18Dec 5, 2024Updated last year
- ☆51Jan 31, 2026Updated last month
- A structurally comprehensive dataset of AMR-to-text alignments for coverage of a larger variety of linguistic phenomena, for research rel…☆16Dec 10, 2022Updated 3 years ago
- tianchi 天池 广东工业智造算法赛 广东工业智造大数据创新算法赛 铝材表面瑕疵检测☆22Oct 28, 2018Updated 7 years ago
- Paper: “MEMRL: SELF-EVOLVING AGENTS VIA RUNTIME REINFORCEMENT LEARNING ON EPISODIC MEMORY” Open-Source Code☆55Feb 27, 2026Updated 3 weeks ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆54Dec 13, 2025Updated 3 months ago
- ☆41Dec 15, 2025Updated 3 months ago
- Codes for ACL2023 paper: Knowledgeable Parameter Efficient Tuning Network for Commonsense Question Answering.☆11Sep 23, 2023Updated 2 years ago
- Self evolve extension for openclaw. Let your claw grow continuously.☆71Mar 14, 2026Updated last week
- Model and datasets for schema matching☆14Jul 17, 2021Updated 4 years ago
- Code-Style In-Context Learning for Knowledge-Based Question Answering☆14Mar 3, 2024Updated 2 years ago
- ☆33Jul 15, 2025Updated 8 months ago
- [ICLR 2026] "VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?", Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, L…☆37Jan 30, 2026Updated last month
- [Paper][CCKS2023] CausE: Towards Causal Knowledge Graph Embedding☆17Jul 30, 2023Updated 2 years ago
- Heatmap-based Out-of-Distribution Detection (WACV 2023)☆13Mar 27, 2024Updated last year
- ☆12Nov 22, 2022Updated 3 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- The code of CIKM 2023 short paper : Bridging the KB-Text Gap: Leveraging Structured Knowledge-aware Pre-training for KBQA☆19Jul 19, 2024Updated last year
- Accepted by ACL 2025☆30Aug 13, 2025Updated 7 months ago
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆68Dec 18, 2025Updated 3 months ago
- ☆13Feb 28, 2025Updated last year
- ☆12Sep 28, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- This is the official code for the paper "SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation".☆60Sep 27, 2024Updated last year
- ☆15Mar 6, 2025Updated last year
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆74Feb 27, 2026Updated 3 weeks ago
- CATArena is an engineering-level tournament evaluation platform for Large Language Model-driven code agents (LLM-driven code agents), bas…☆60Dec 25, 2025Updated 2 months ago
- RL with Experience Replay☆55Jul 27, 2025Updated 7 months ago
- ☆24Jun 13, 2017Updated 8 years ago
- ☆81Oct 1, 2025Updated 5 months ago
- ☆22Jul 2, 2025Updated 8 months ago
- LLM tools for running queries against SQLite☆49May 27, 2025Updated 9 months ago
- A challenging aggregation benchmark for long-context models☆41Feb 22, 2026Updated last month