caixd-220529 / LifelongAgentBenchLinks
Code repo for "LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners"
☆56Updated 6 months ago
Alternatives and similar repositories for LifelongAgentBench
Users that are interested in LifelongAgentBench are comparing it to the libraries listed below
Sorting:
- ☆70Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆50Updated last year
- ☆146Updated 6 months ago
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆34Updated last year
- ☆123Updated last week
- The latest progress of Personalized Large Language Models (LLMs).☆33Updated last month
- [ACL 2025 Main] (🏆 Outstanding Paper Award) Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Proba…☆14Updated 4 months ago
- ☆28Updated last year
- A Survey of Personalization: From RAG to Agent☆91Updated 4 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆97Updated last year
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆156Updated 7 months ago
- A research repo for experiments about Reinforcement Finetuning☆53Updated 8 months ago
- ☆94Updated 8 months ago
- [arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents☆46Updated 5 months ago
- [NeurIPS 2024] Official implementation for paper "Can Graph Learning Improve Planning in LLM-based Agents?"☆147Updated 7 months ago
- [NeurIPS 2024] GITA: Graph to Image-Text Integration for Vision-Language Graph Reasoning☆52Updated 3 weeks ago
- ☆292Updated 5 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆80Updated last month
- ☆21Updated 3 months ago
- [ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing☆36Updated last year
- This is the repo for the survey of Bias and Fairness in IR with LLMs.☆59Updated 3 months ago
- ☆79Updated last week
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆17Updated 3 months ago
- ☆24Updated 8 months ago
- Accepted LLM Papers in NeurIPS 2024☆37Updated last year
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆21Updated 5 months ago
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆52Updated 4 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆150Updated last year
- ☆47Updated 10 months ago
- Reinforced Multi-LLM Agents training☆60Updated 6 months ago