caixd-220529 / LifelongAgentBenchLinks
Code repo for "LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners"
☆41Updated last month
Alternatives and similar repositories for LifelongAgentBench
Users that are interested in LifelongAgentBench are comparing it to the libraries listed below
Sorting:
- A Survey of Personalization: From RAG to Agent☆54Updated this week
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆95Updated last week
- ☆85Updated last month
- ☆47Updated 5 months ago
- 在verl上做reward的定制开发☆75Updated last month
- This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)☆57Updated last month
- A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning☆35Updated last month
- Yelp Simulator for WWW'25 AgentSociety Challenge☆81Updated 2 months ago
- Awesome Agent Training☆188Updated last week
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆33Updated 10 months ago
- ☆144Updated 10 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆124Updated last week
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆201Updated last week
- ☆28Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆45Updated 8 months ago
- ☆242Updated last week
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆77Updated 8 months ago
- Paper List for In-context Learning 🌷☆183Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆127Updated 9 months ago
- ☆38Updated 3 months ago
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆49Updated 2 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆432Updated last week
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆145Updated 6 months ago
- The latest progress of Personalized Large Language Models (LLMs).☆22Updated last month
- ☆94Updated 4 months ago
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.☆219Updated last week
- Neural Code Intelligence Survey 2024; Reading lists and resources☆265Updated 3 weeks ago
- CycleResearcher: Improving Automated Research via Automated Review☆210Updated last week
- This is the repo for the survey of Bias and Fairness in IR with LLMs.☆54Updated 3 months ago
- SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for…☆79Updated 7 months ago