NoviScl / Design2CodeLinks
☆547Updated last year
Alternatives and similar repositories for Design2Code
Users that are interested in Design2Code are comparing it to the libraries listed below
Sorting:
- An LLM-based Web Navigating Agent (KDD'24)☆895Updated last year
- ☆251Updated last year
- Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"☆370Updated 3 weeks ago
- A repo with an automated prompt engineering workflow from scratch. It leverages the OPRO technique.☆201Updated last year
- AI for all: Build the large graph of the language models☆277Updated last year
- [ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large mult…☆802Updated 9 months ago
- [ICLR 2025] The First Multimodal Seach Engine Pipeline and Benchmark for LMMs☆480Updated 9 months ago
- ☆66Updated last year
- [NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist w…☆895Updated 2 weeks ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆1,449Updated last year
- AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI☆1,059Updated 11 months ago
- LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.☆163Updated 5 months ago
- An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation☆853Updated last year
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆367Updated last year
- 🦀️ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆381Updated 4 months ago
- An open-sourced end-to-end VLM-based GUI Agent☆1,085Updated 7 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆220Updated 5 months ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆370Updated 8 months ago
- 🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.☆264Updated 9 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆344Updated last year
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs☆510Updated 2 years ago
- 👩⚖️ Coding Agent-as-a-Judge☆667Updated 6 months ago
- ☆132Updated last year
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆669Updated 4 months ago
- ☆250Updated last year
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆512Updated 10 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆242Updated last year
- [NeurlPS D&B 2024] Generative AI for Math: MathPile☆418Updated 7 months ago
- HPT - Open Multimodal LLMs from HyperGAI☆315Updated last year
- ☆313Updated last year