NoviScl / Design2CodeLinks
☆556Updated last year
Alternatives and similar repositories for Design2Code
Users that are interested in Design2Code are comparing it to the libraries listed below
Sorting:
- An LLM-based Web Navigating Agent (KDD'24)☆918Updated last year
- Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"☆371Updated 2 months ago
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆1,065Updated last year
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆1,521Updated last year
- An open-sourced end-to-end VLM-based GUI Agent☆1,113Updated 9 months ago
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆811Updated 11 months ago
- LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.☆166Updated 7 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆929Updated 2 months ago
- A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.☆203Updated last year
- ☆252Updated 2 years ago
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆482Updated 11 months ago
- 🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆388Updated last week
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation☆858Updated 2 years ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆376Updated 10 months ago
- Web-Bench is a benchmark designed to evaluate the performance of LLMs in actual Web development.☆247Updated last month
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"☆1,279Updated last month
- An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through in…☆790Updated 3 months ago
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs☆511Updated 2 years ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆219Updated 6 months ago
- 👩⚖️ Agent-as-a-Judge: The Magic for Open-Endedness☆703Updated 7 months ago
- Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.☆807Updated 8 months ago
- A generalized information-seeking agent system with Large Language Models (LLMs).☆1,196Updated last year
- ☆316Updated last year
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆674Updated 6 months ago
- 🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.☆265Updated 10 months ago
- Convert any web design screenshot to clean HTML/CSS code☆662Updated 2 weeks ago
- A LLM-based Agent that predict its tasks proactively.☆466Updated 4 months ago
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆367Updated 2 years ago
- [NeurlPS D&B 2024] Generative AI for Math: MathPile☆419Updated 9 months ago
- CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative H…☆322Updated last year