THUDM / AutoWebGLMLinks
An LLM-based Web Navigating Agent (KDD'24)
☆881Updated 11 months ago
Alternatives and similar repositories for AutoWebGLM
Users that are interested in AutoWebGLM are comparing it to the libraries listed below
Sorting:
- An open-sourced end-to-end VLM-based GUI Agent☆1,049Updated 5 months ago
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation☆852Updated last year
- A LLM-based Agent that predict its tasks proactively.☆417Updated 3 weeks ago
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,183Updated last year
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆503Updated 8 months ago
- ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)☆504Updated 9 months ago
- An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through in…☆752Updated 10 months ago
- A python native agent framework☆458Updated 9 months ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,464Updated last year
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆280Updated 3 months ago
- Setup AI2Apps at local system so you can use your own OpenAI key or make more back-end features.☆433Updated last week
- ReMe: Memory Management Framework for Agents - Remember Me, Refine Me.☆529Updated this week
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆868Updated 5 months ago
- ☆232Updated last year
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,445Updated 3 months ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆782Updated 7 months ago
- ☆252Updated last year
- Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"☆364Updated 3 weeks ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆373Updated 4 months ago
- 🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆372Updated 2 months ago
- 🤖 Awesome list of AGI Agents. Agents 精选资源合集.☆468Updated last year
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆450Updated 3 months ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆354Updated 6 months ago
- A lightweight framework for building LLM-based agents☆2,189Updated last month
- ☆75Updated 3 weeks ago
- ☆962Updated 7 months ago
- Enhance LLM agents with rich tool APIs☆398Updated last year
- Easy, fast, and cheap pretrain,finetune, serving for everyone☆315Updated last month
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆158Updated 5 months ago
- 中文Mixtral-8x7B(Chinese-Mixtral-8x7B)☆653Updated last year