caixd-220529 / LifelongAgentBenchLinks
Code repo for "LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners"
☆54Updated 6 months ago
Alternatives and similar repositories for LifelongAgentBench
Users that are interested in LifelongAgentBench are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆49Updated last year
- The latest progress of Personalized Large Language Models (LLMs).☆29Updated last month
- ☆66Updated 11 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆93Updated last year
- A Survey of Personalization: From RAG to Agent☆87Updated 3 months ago
- ☆117Updated 2 weeks ago
- ☆25Updated 7 months ago
- ☆135Updated 6 months ago
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆50Updated 3 months ago
- ☆289Updated 4 months ago
- ☆91Updated 8 months ago
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆34Updated last year
- Reinforced Multi-LLM Agents training☆59Updated 5 months ago
- A research repo for experiments about Reinforcement Finetuning☆52Updated 7 months ago
- [ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing☆36Updated last year
- ☆28Updated last year
- Accepted LLM Papers in NeurIPS 2024☆37Updated last year
- [ACL2023] Preserving Commonsense Knowledge from Pre-trained Language Models via Causal Inference☆24Updated last year
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆143Updated 6 months ago
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Updated 3 months ago
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆14Updated 2 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆54Updated 2 months ago
- ☆168Updated last month
- [ICML 2025] Official resources of "KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search".☆33Updated 3 months ago
- VeriGUI: Verifiable Long-Chain GUI Dataset☆82Updated last month
- AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection☆18Updated 2 years ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆196Updated 3 weeks ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆79Updated 3 weeks ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆148Updated last year
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆23Updated 6 months ago