caixd-220529 / LifelongAgentBenchLinks
Code repo for "LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners"
☆73Updated 8 months ago
Alternatives and similar repositories for LifelongAgentBench
Users that are interested in LifelongAgentBench are comparing it to the libraries listed below
Sorting:
- ☆100Updated 10 months ago
- ☆75Updated last year
- ☆177Updated last month
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆238Updated 2 months ago
- ☆186Updated 3 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆142Updated 11 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆83Updated 2 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆50Updated last year
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆133Updated 10 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆290Updated 2 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆156Updated 7 months ago
- ☆45Updated last month
- A research repo for experiments about Reinforcement Finetuning☆53Updated 9 months ago
- Reinforced Multi-LLM Agents training☆69Updated last week
- ☆223Updated 3 weeks ago
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆15Updated 5 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆312Updated 3 weeks ago
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆52Updated 5 months ago
- ☆43Updated 5 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆103Updated last year
- A Survey of Personalization: From RAG to Agent☆97Updated 5 months ago
- ☆25Updated 9 months ago
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆70Updated 2 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆134Updated 10 months ago
- ☆54Updated 10 months ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆99Updated last year
- ☆213Updated 6 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆47Updated 8 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆71Updated 8 months ago
- ☆51Updated last year