NoviScl / Design2CodeLinks
β517Updated 8 months ago
Alternatives and similar repositories for Design2Code
Users that are interested in Design2Code are comparing it to the libraries listed below
Sorting:
- An LLM-based Web Navigating Agent (KDD'24)β870Updated 9 months ago
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β352Updated last week
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMsβ450Updated 5 months ago
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist wβ¦β841Updated 3 months ago
- Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"β351Updated 3 months ago
- LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.β163Updated last month
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multβ¦β760Updated 5 months ago
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UIβ1,045Updated 7 months ago
- β249Updated last year
- A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.β198Updated 10 months ago
- β303Updated last year
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhanβ¦β1,289Updated last year
- AI for all: Build the large graph of the language modelsβ270Updated last year
- An open-sourced end-to-end VLM-based GUI Agentβ992Updated 3 months ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interactionβ332Updated 4 months ago
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoβ¦β364Updated last year
- β61Updated 11 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agentsβ210Updated last month
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and β¦β345Updated last year
- Code for our ACL 2023 Paper "Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models".β675Updated 2 years ago
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automationβ851Updated last year
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Modelsβ646Updated 2 weeks ago
- A generalized information-seeking agent system with Large Language Models (LLMs).β1,170Updated last year
- A LLM-based Agent that predict its tasks proactively.β389Updated last month
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMsβ510Updated last year
- [NeurlPS D&B 2024] Generative AI for Math: MathPileβ414Updated 3 months ago
- HPT - Open Multimodal LLMs from HyperGAIβ316Updated last year
- We release a general framework for prompting LLMs to manipulate software in a closed-loop manner.β137Updated 10 months ago
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QAβ501Updated 6 months ago
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffoldingβ393Updated last year