hkust-nlp/GUIMid

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hkust-nlp/GUIMid)

hkust-nlp / GUIMid

☆22

Alternatives and similar repositories for GUIMid

Users that are interested in GUIMid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated 11 months ago
chang-github-00 / Predictive-Decoding
View on GitHub
Repo for Anonymous purpose, pls don't distribute
☆10Oct 2, 2024Updated last year
HeimingX / TAG
View on GitHub
Official code for Attention-driven GUI Grounding, AAAI2025
☆16Dec 17, 2024Updated last year
hanxuhu / SeqIns
View on GitHub
The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…
☆30Nov 24, 2024Updated last year
xufangzhi / Odyssey-Arena
View on GitHub
Extremely Long-Horizon Agentic Tasks Requiring Active Acting and Inductive Reasoning
☆33Feb 9, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yayayacc / MUR
View on GitHub
☆49May 14, 2026Updated 2 months ago
chengyou-jia / T2IS
View on GitHub
Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"
☆21Oct 1, 2025Updated 9 months ago
yayayacc / TIDE
View on GitHub
☆18Feb 4, 2026Updated 5 months ago
MJ-Bench / MJ-Bench
View on GitHub
(NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"
☆51Jun 3, 2025Updated last year
wjn1996 / Chain-of-Knowledge
View on GitHub
☆24Jun 13, 2023Updated 3 years ago
njucckevin / OpenMobile-Code
View on GitHub
The model, data and code for OpenMobile
☆50Jul 9, 2026Updated 2 weeks ago
hkust-nlp / deepsearch-tts
View on GitHub
Pushing Test-Time Scaling Limits of Deep Search with Asymmetric Verification
☆21Oct 8, 2025Updated 9 months ago
OS-Copilot / OS-Sentinel
View on GitHub
[ACL 2026] Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic…
☆49Jul 5, 2026Updated 3 weeks ago
xinghaow99 / prism
View on GitHub
[ICML 2026] Prism: Spectral-Aware Block-Sparse Attention
☆27May 22, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OS-Copilot / ScienceBoard
View on GitHub
[ICLR 2026] Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"
☆132Feb 2, 2026Updated 5 months ago
njucckevin / CapArena
View on GitHub
An Arena-style Automated Evaluation Benchmark for Detailed Captioning
☆59Jun 1, 2025Updated last year
which47 / LLMCL
View on GitHub
Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning
☆38Nov 17, 2024Updated last year
showlab / macosworld
View on GitHub
☆35Jan 28, 2026Updated 5 months ago
hanxuhu / chain-of-symbol-planning
View on GitHub
☆23May 25, 2023Updated 3 years ago
HITsz-TMG / ICL-State-Vector
View on GitHub
☆12Jul 4, 2024Updated 2 years ago
OpenGVLab / ZeroGUI
View on GitHub
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
☆119Jul 17, 2025Updated last year
LuLuLuyi / LongHeads
View on GitHub
[EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor
☆32Apr 8, 2024Updated 2 years ago
chengyou-jia / ChatGen
View on GitHub
[CVPR 2025] ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
☆33Dec 5, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
xcltql666 / DenseDiT
View on GitHub
Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"
☆27Jun 7, 2026Updated last month
hkust-nlp / SynCSE
View on GitHub
This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"
☆40Jun 9, 2023Updated 3 years ago
microsoft / GUI-Actor
View on GitHub
[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
☆410Apr 13, 2026Updated 3 months ago
X-GenGroup / PaCo-RL
View on GitHub
Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*
☆42Dec 13, 2025Updated 7 months ago
cylnlp / convsumx
View on GitHub
Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation
☆19Mar 23, 2024Updated 2 years ago
njucckevin / SeeClick
View on GitHub
The model, data and code for the visual GUI Agent SeeClick
☆493Jul 13, 2025Updated last year
xufangzhi / Genius
View on GitHub
[ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework
☆72Jun 1, 2025Updated last year
baopj / Vid-Morp
View on GitHub
☆12Dec 6, 2024Updated last year
LuLuLuyi / R-HORIZON
View on GitHub
[ICLR'2026] R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
☆18Oct 21, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ai-agents-2030 / ViMo
View on GitHub
☆26Apr 2, 2026Updated 3 months ago
allenai / openpi-dataset
View on GitHub
OpenPI dataset for tracking entities in open domain procedural text
☆24Aug 13, 2024Updated last year
ljang0 / videowebarena
View on GitHub
☆14Dec 25, 2024Updated last year
CONE-MT / LLaMAX
View on GitHub
☆75Dec 6, 2024Updated last year
hkust-nlp / model-task-align-rl
View on GitHub
[ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".
☆18Feb 9, 2026Updated 5 months ago
OSU-NLP-Group / Explorer
View on GitHub
[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
☆29Feb 17, 2026Updated 5 months ago
InternLM / JanusCoder
View on GitHub
[ICLR 2026] JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence
☆78May 9, 2026Updated 2 months ago