☆22May 3, 2025Updated last year
Alternatives and similar repositories for GUIMid
Users that are interested in GUIMid are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆49May 14, 2026Updated last month
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆27Jun 7, 2026Updated last week
- (NeurIPS 2025) Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"☆51Jun 3, 2025Updated last year
- ☆24Jun 13, 2023Updated 3 years ago
- Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"☆65Dec 4, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆37Nov 17, 2024Updated last year
- ☆23May 25, 2023Updated 3 years ago
- ☆12Jul 4, 2024Updated last year
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆57Oct 16, 2025Updated 8 months ago
- ☆130Oct 3, 2025Updated 8 months ago
- [EMNLP'24] LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆32Apr 8, 2024Updated 2 years ago
- This is the official implementation of the paper: "Contrastive Learning of Sentence Embeddings from Scratch"☆40Jun 9, 2023Updated 3 years ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆40Dec 13, 2025Updated 6 months ago
- Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation☆19Mar 23, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- The model, data and code for the visual GUI Agent SeeClick☆483Jul 13, 2025Updated 11 months ago
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated last year
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆247May 5, 2025Updated last year
- Code for Research Project TLDR☆25Jul 28, 2025Updated 10 months ago
- ☆14Dec 25, 2024Updated last year
- ☆75Dec 6, 2024Updated last year
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated last year
- OpenPI dataset for tracking entities in open domain procedural text☆24Aug 13, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆29Feb 25, 2025Updated last year
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- [EMNLP 2024 Findings] SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback wit…☆89Jan 18, 2026Updated 4 months ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 11 months ago
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆452Apr 20, 2025Updated last year
- [NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆408Apr 13, 2026Updated 2 months ago
- The second Homework of NLP☆13Jun 9, 2021Updated 5 years ago
- ☆11Oct 2, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆29Feb 17, 2026Updated 3 months ago
- ☆36Mar 10, 2025Updated last year
- [Pattern Recognition] The implementation of MoCA☆12Apr 1, 2023Updated 3 years ago
- ☆15Mar 20, 2025Updated last year
- ☆13Apr 5, 2026Updated 2 months ago
- [ICML 2025] Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment (https://arxiv.org/abs/2410.02197)☆42Sep 8, 2025Updated 9 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 9 months ago