NoviScl / Design2Code
☆468Updated 2 months ago
Alternatives and similar repositories for Design2Code:
Users that are interested in Design2Code are comparing it to the libraries listed below
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆697Updated 2 weeks ago
- Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"☆316Updated this week
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆561Updated 8 months ago
- An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through in…☆639Updated 3 months ago
- 🤠 Agent-as-a-Judge and DevAI dataset☆313Updated last week
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"☆766Updated 6 months ago
- ControlLLM: Augment Language Models with Tools by Searching on Graphs☆188Updated 6 months ago
- ☆247Updated last year
- AI for all: Build the large graph of the language models☆252Updated 7 months ago
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆1,016Updated last month
- ☆293Updated 10 months ago
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆347Updated last year
- An open-sourced end-to-end VLM-based GUI Agent☆637Updated this week
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆409Updated last week
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆332Updated 7 months ago
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs☆506Updated last year
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆384Updated last month
- A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.☆176Updated 4 months ago
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,125Updated 7 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆583Updated 3 weeks ago
- An LLM-based Web Navigating Agent (KDD'24)☆801Updated 4 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆257Updated 2 weeks ago
- Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆185Updated 2 weeks ago
- [IJCAI 2024] Generate different roles for GPTs to form a collaborative entity for complex tasks.☆1,265Updated 9 months ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,384Updated last year
- Set-of-Mark Prompting for GPT-4V and LMMs☆1,258Updated 5 months ago
- OpenResearcher, an advanced Scientific Research Assistant☆420Updated 3 months ago
- HPT - Open Multimodal LLMs from HyperGAI☆313Updated 7 months ago
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills☆721Updated 11 months ago
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆319Updated 8 months ago