MobileLLM / Personal_LLM_Agents_Survey
Paper list for Personal LLM Agents
☆345Updated 7 months ago
Alternatives and similar repositories for Personal_LLM_Agents_Survey:
Users that are interested in Personal_LLM_Agents_Survey are comparing it to the libraries listed below
- This is the repository for the Tool Learning survey.☆274Updated last month
- Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"☆282Updated 8 months ago
- papers related to LLM-agent that published on top conferences☆306Updated 10 months ago
- ☆174Updated 3 weeks ago
- [ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding☆692Updated 3 weeks ago
- The model, data and code for the visual GUI Agent SeeClick☆248Updated 3 weeks ago
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆166Updated this week
- ☆369Updated 2 months ago
- An Awesome Collection for LLM Survey☆313Updated 3 months ago
- ☆320Updated 6 months ago
- The related works and background techniques about Openai o1☆168Updated last month
- ☆141Updated 7 months ago
- Survey Paper List - Efficient LLM and Foundation Models☆229Updated 2 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆121Updated 3 months ago
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)☆291Updated 3 months ago
- A collection of 150+ surveys on LLMs☆218Updated 2 months ago
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…☆349Updated 8 months ago
- [ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.☆224Updated last month
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…☆441Updated last month
- A Comprehensive Benchmark for Software Development.☆85Updated 6 months ago
- ☆115Updated last year
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆366Updated 3 months ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆254Updated 4 months ago
- Paper collection on building and evaluating language model agents via executable language grounding☆341Updated 7 months ago
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆186Updated 2 months ago
- ☆268Updated 3 weeks ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆206Updated 5 months ago
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆368Updated 5 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆397Updated 2 months ago
- AndroidWorld is an environment and benchmark for autonomous agents☆151Updated this week