THUDM / AutoWebGLM
An LLM-based Web Navigating Agent (KDD'24)
β828Updated 5 months ago
Alternatives and similar repositories for AutoWebGLM:
Users that are interested in AutoWebGLM are comparing it to the libraries listed below
- An open-sourced end-to-end VLM-based GUI Agentβ837Updated last month
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automationβ830Updated last year
- π WebWalker: Benchmarking LLMs in Web Traversalβ376Updated this week
- A generalized information-seeking agent system with Large Language Models (LLMs).β1,138Updated 9 months ago
- A python native agent frameworkβ447Updated 3 months ago
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QAβ480Updated 2 months ago
- A lightweight framework for building LLM-based agentsβ2,076Updated last week
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β313Updated 4 months ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMsβ1,402Updated last year
- An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through inβ¦β684Updated 5 months ago
- π€ Awesome list of AGI Agents. Agents η²Ύιθ΅ζΊει.β361Updated last year
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β731Updated last month
- Build & Optimize your RAG.β564Updated last week
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi eβ¦β422Updated last week
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agentβ274Updated this week
- A LLM-based Agent that predict its tasks proactively.β332Updated this week
- β930Updated last month
- The model, data and code for the visual GUI Agent SeeClickβ339Updated 4 months ago
- Easy, fast, and cheap pretrain,finetune, serving for everyoneβ291Updated this week
- [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.β1,117Updated last week
- π» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.β572Updated 2 weeks ago
- ScreenAgent: A Computer Control Agent Driven by Visual Language Large Model (IJCAI-24)β420Updated 3 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RLβ330Updated last month
- [GenAI Application Development Framework] π Build GenAI application quick and easy π¬ Easy to interact with GenAI agent in code using sβ¦β1,270Updated this week
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarkingβ647Updated this week
- OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinkingβ421Updated last week
- Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ265Updated 2 weeks ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"β804Updated this week