tml1026 / Lifelong-Personalized-AgentLinks
☆15Updated 3 months ago
Alternatives and similar repositories for Lifelong-Personalized-Agent
Users that are interested in Lifelong-Personalized-Agent are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆47Updated 5 months ago
- ☆50Updated 5 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆133Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆109Updated 5 months ago
- ☆103Updated last month
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆21Updated 3 months ago
- [TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"☆89Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆248Updated 6 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆83Updated 7 months ago
- Official code repository for Sketch-of-Thought (SoT)☆129Updated 6 months ago
- [COLM 2025] Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale☆74Updated this week
- A-MEM: Agentic Memory for LLM Agents☆151Updated this week
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆64Updated 5 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆178Updated 4 months ago
- Test-time preferenece optimization (ICML 2025).☆169Updated 6 months ago
- Efficient Agent Training for Computer Use☆132Updated 2 months ago
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆41Updated 10 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆51Updated 3 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆198Updated last week
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆110Updated 3 months ago
- [ACL 2025] Knowledge Unlearning for Large Language Models☆46Updated last month
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆68Updated 8 months ago
- Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"☆172Updated 2 weeks ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆21Updated 11 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆146Updated 4 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆166Updated last month
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆129Updated 7 months ago
- ☆50Updated last year
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆104Updated last week