WebPAI / DCGenLinks
☆19Updated last month
Alternatives and similar repositories for DCGen
Users that are interested in DCGen are comparing it to the libraries listed below
Sorting:
- Under construction☆11Updated 6 months ago
- Multimodal Large Language Models for Code Generation under Multimodal Scenarios☆99Updated last week
- A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks☆12Updated 4 months ago
- CVPR25☆23Updated 2 weeks ago
- GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes fr…☆119Updated 8 months ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆91Updated 9 months ago
- Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"☆53Updated last month
- Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations b…☆28Updated last year
- The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"☆37Updated last year
- (ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning☆52Updated last month
- Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via Online Modification"☆27Updated last year
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆60Updated last year
- ☆29Updated 9 months ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆145Updated 2 months ago
- [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"☆81Updated last year
- ZeroGUI: Automating Online GUI Learning at Zero Human Cost☆75Updated 2 weeks ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆75Updated 5 months ago
- [ICLR 2025] Dissecting adversarial robustness of multimodal language model agents☆97Updated 4 months ago
- Official repository for "Safety in Large Reasoning Models: A Survey" - Exploring safety risks, attacks, and defenses for Large Reasoning …☆60Updated last month
- GitHub page for "Large Language Model-Brained GUI Agents: A Survey"☆173Updated 3 weeks ago
- enchmarking Large Language Models' Resistance to Malicious Code☆12Updated 7 months ago
- Official code repo for the paper "LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark"☆31Updated 2 months ago
- ☆13Updated last year
- [ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training☆44Updated 5 months ago
- ☆31Updated 2 months ago
- A Systematic Literature Review on Large Language Models for Automated Program Repair☆194Updated 7 months ago
- Official Code for ACL 2024 paper "GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis"☆57Updated 8 months ago
- ☆21Updated 3 months ago
- Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"☆57Updated 4 months ago
- [ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration R…☆101Updated last week