OSU-NLP-Group / SeeActView on GitHub
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
826Feb 3, 2025Updated last year

Alternatives and similar repositories for SeeAct

Users that are interested in SeeAct are comparing it to the libraries listed below

Sorting:

Are these results useful?