wangzx1219 / AgentDropoutLinks
☆33Updated 2 months ago
Alternatives and similar repositories for AgentDropout
Users that are interested in AgentDropout are comparing it to the libraries listed below
Sorting:
- ☆66Updated 3 months ago
- ☆39Updated 6 months ago
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆39Updated last month
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆54Updated 6 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆124Updated 3 months ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆72Updated last week
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆97Updated 2 weeks ago
- ☆67Updated 3 weeks ago
- ☆62Updated last week
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆125Updated 9 months ago
- ☆24Updated 2 months ago
- Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors☆25Updated last month
- Accepted LLM Papers in NeurIPS 2024☆37Updated 8 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆69Updated last month
- MARFT stands for Multi-Agent Reinforcement Fine-Tuning. This repository implements an LLM-based multi-agent reinforcement fine-tuning fra…☆43Updated last week
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆70Updated 7 months ago
- A research repo for experiments about Reinforcement Finetuning☆48Updated 2 months ago
- ☆64Updated 3 weeks ago
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆33Updated 9 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆45Updated last month
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆136Updated last week
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆65Updated 2 months ago
- ☆59Updated 6 months ago
- ☆37Updated this week
- This is a unified platform for implementing and evaluating test-time reasoning mechanisms in Large Language Models (LLMs).☆19Updated 5 months ago
- ☆30Updated last month
- [ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"☆25Updated last week
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆75Updated 3 weeks ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆59Updated last year
- A Survey on Large Language Model based Human-Agent Systems | Human-Agent Collaboration | Human-AI Collaboration☆69Updated this week