[CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents
β32Jun 3, 2025Updated 10 months ago
Alternatives and similar repositories for monday
Users that are interested in monday are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πΈ Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"β22Sep 5, 2023Updated 2 years ago
- A simple visual test-time scaling method for GUI agent groundingβ23Dec 7, 2025Updated 4 months ago
- [AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for Generalβ¦β88Dec 1, 2025Updated 4 months ago
- β How Robust are Fact Checking Systems on Colloquial Claims?. In NAACL-HLT, 2021.β23Jul 1, 2021Updated 4 years ago
- β24Oct 9, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Debiasing Through Data Attributionβ13May 23, 2024Updated last year
- β14Dec 25, 2024Updated last year
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMsβ13Jun 20, 2025Updated 9 months ago
- Official Repository for our ICCV2021 paper: Continual Learning on Noisy Data Streams via Self-Purified Replayβ31Jan 5, 2022Updated 4 years ago
- β13Jan 30, 2021Updated 5 years ago
- Flexibly track outputs and grad-outputs of torch.nn.Module.β13Oct 6, 2023Updated 2 years ago
- β18Nov 10, 2024Updated last year
- β10Jul 6, 2023Updated 2 years ago
- Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Dataseβ¦β13Jun 24, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β10Dec 3, 2021Updated 4 years ago
- Comprehensive benchmark for video text understandingβ28Jun 4, 2025Updated 10 months ago
- β10Feb 13, 2023Updated 3 years ago
- [NeurIPS'24] GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matchingβ31May 29, 2025Updated 10 months ago
- β24Jun 22, 2025Updated 9 months ago
- π΅ Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"β25Dec 14, 2025Updated 4 months ago
- β18Sep 25, 2025Updated 6 months ago
- β16Jan 4, 2022Updated 4 years ago
- [CVPR2023] Practical Network Acceleration with Tiny Setsβ14Jul 28, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official code for ICML 2024 paper "Learning to Continually Learn with the Bayesian Principle"β20May 27, 2024Updated last year
- The source code of ExFunTubeβ10Aug 8, 2025Updated 8 months ago
- A Universal Platform for Training and Evaluation of Mobile Interactionβ61Sep 24, 2025Updated 6 months ago
- Command-line tool for downloading and extending the RedCaps dataset.β49Dec 18, 2023Updated 2 years ago
- [ACCV 2024] Simple, Easy 3D Object Detection with Point-Wise Semanticsβ15Oct 28, 2025Updated 5 months ago
- A highly customizable statusline for Vimβ11Dec 20, 2016Updated 9 years ago
- Pytorch implementation of Centered Kernel Alignment(CKA) and its minibatch version.β11May 11, 2022Updated 3 years ago
- Official Repository for our CVPR2024 paper: ESR-NeRF: Emissive Source Reconstruction Using LDR Multi-view Imagesβ15Jun 13, 2024Updated last year
- [ICLR24] AutoVP: An Automated Visual Prompting Framework and Benchmarkβ23Sep 18, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Automatic, unsupervised collection of web agent training data via exploration.β24Oct 8, 2025Updated 6 months ago
- β12Jan 10, 2025Updated last year
- A digital twin of the city of Chicago along with automated sensorsβ13Nov 14, 2019Updated 6 years ago
- FuseCap: Leveraging Large Language Models for Enriched Fused Image Captionsβ55Apr 17, 2024Updated last year
- ACAV100M: Automatic Curation of Large-Scale Datasets for Audio-Visual Video Representation Learning. In ICCV, 2021.β64Nov 18, 2021Updated 4 years ago
- π» A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.β1,163Aug 17, 2025Updated 7 months ago
- β18Jun 10, 2024Updated last year