wangzx1219 / AgentDropoutLinks
☆38Updated 3 months ago
Alternatives and similar repositories for AgentDropout
Users that are interested in AgentDropout are comparing it to the libraries listed below
Sorting:
- ☆70Updated 3 months ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆130Updated last month
- ☆46Updated 7 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆68Updated 6 months ago
- ☆26Updated 3 months ago
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆76Updated last month
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆67Updated 2 months ago
- Accepted LLM Papers in NeurIPS 2024☆37Updated 9 months ago
- ☆85Updated last month
- A research repo for experiments about Reinforcement Finetuning☆49Updated 3 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆154Updated last year
- ☆64Updated 3 weeks ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆85Updated 4 months ago
- Implementation of the MATRIX framework (ICML 2024)☆56Updated last year
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆79Updated 2 months ago
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆27Updated last month
- 📜 Paper list on decoding methods for LLMs and LVLMs☆52Updated 2 weeks ago
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆96Updated 2 weeks ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆127Updated 9 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆134Updated this week
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆282Updated last week
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆19Updated 2 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆124Updated 3 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆45Updated 2 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆79Updated last month
- ☆33Updated 9 months ago
- ☆53Updated this week
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆63Updated 3 months ago
- ☆63Updated last week
- ☆51Updated last month