THUDM / AutoWebGLMLinks
An LLM-based Web Navigating Agent (KDD'24)
☆859Updated 8 months ago
Alternatives and similar repositories for AutoWebGLM
Users that are interested in AutoWebGLM are comparing it to the libraries listed below
Sorting:
- An open-sourced end-to-end VLM-based GUI Agent☆956Updated last month
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation☆845Updated last year
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,438Updated last year
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆495Updated 5 months ago
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,160Updated 11 months ago
- 🌐 WebWalker [ACL2025] & WebDancer [Preprint]☆421Updated this week
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆329Updated last month
- Enhance LLM agents with rich tool APIs☆387Updated 8 months ago
- An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through in…☆719Updated 7 months ago
- The model, data and code for the visual GUI Agent SeeClick☆378Updated 6 months ago
- ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)☆455Updated 6 months ago
- A LLM-based Agent that predict its tasks proactively.☆367Updated last week
- A lightweight framework for building LLM-based agents☆2,134Updated 2 months ago
- 🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆344Updated this week
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆306Updated 2 months ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆748Updated 4 months ago
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆474Updated 2 months ago
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.☆1,261Updated last week
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆387Updated last month
- ☆248Updated last year
- ☆939Updated 3 months ago
- 🩹Editing large language models within 10 seconds⚡☆1,327Updated last year
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆347Updated last year
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆142Updated 2 months ago
- Using Groq or OpenAI or Ollama to create o1-like reasoning chains☆295Updated 8 months ago
- ☆320Updated 11 months ago
- Setup AI2Apps at local system so you can use your own OpenAI key or make more back-end features.☆423Updated last week
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆1,039Updated 5 months ago
- A python native agent framework☆453Updated 6 months ago
- ☆478Updated 3 months ago