hkust-nlp / GUIMidView external linksLinks
☆21May 3, 2025Updated 9 months ago
Alternatives and similar repositories for GUIMid
Users that are interested in GUIMid are comparing it to the libraries listed below
Sorting:
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Nov 24, 2024Updated last year
- Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"☆21Oct 1, 2025Updated 4 months ago
- ☆14Mar 20, 2025Updated 10 months ago
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year
- ☆47Oct 2, 2025Updated 4 months ago
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated 8 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 6 months ago
- ☆21Jul 21, 2025Updated 6 months ago
- ☆19Jun 4, 2025Updated 8 months ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆26Oct 15, 2025Updated 4 months ago
- ☆23Jan 28, 2026Updated 2 weeks ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆28Jul 7, 2025Updated 7 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆17Oct 17, 2025Updated 3 months ago
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆36Nov 17, 2024Updated last year
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Jun 1, 2025Updated 8 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows"☆37Nov 10, 2025Updated 3 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated 11 months ago
- Official repo of "MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents". It can be used to evaluate a GUI agent w…☆99Sep 8, 2025Updated 5 months ago
- Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation☆19Mar 23, 2024Updated last year
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆31Dec 13, 2025Updated 2 months ago
- ☆122Oct 3, 2025Updated 4 months ago
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆47Jun 3, 2025Updated 8 months ago
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆28Jul 14, 2025Updated 7 months ago
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆34Aug 28, 2025Updated 5 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- ☆17Aug 1, 2025Updated 6 months ago
- Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges☆27May 14, 2025Updated 9 months ago
- ☆23May 25, 2023Updated 2 years ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆56Jun 1, 2025Updated 8 months ago
- [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆108May 18, 2025Updated 8 months ago
- OpenPI dataset for tracking entities in open domain procedural text☆24Aug 13, 2024Updated last year
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 7 months ago
- Code for paper "Analog Foundation Models"☆30Sep 18, 2025Updated 4 months ago
- ☆45May 27, 2025Updated 8 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Jan 8, 2026Updated last month
- Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"☆64Dec 4, 2025Updated 2 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 4 months ago
- ☆24Jun 13, 2023Updated 2 years ago