Repository of GUI Action Narrator
☆13Apr 8, 2025Updated 11 months ago
Alternatives and similar repositories for GUI-Narrator
Users that are interested in GUI-Narrator are comparing it to the libraries listed below
Sorting:
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 2 years ago
- Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.☆116Jul 27, 2025Updated 7 months ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆43Mar 11, 2025Updated last year
- ☆30Apr 16, 2024Updated last year
- [ECCV 2022] GEB+: A Benchmark for Generic Event Boundary Captioning, Grounding and Retrieval☆17Aug 24, 2022Updated 3 years ago
- [NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos☆51Feb 22, 2026Updated 3 weeks ago
- Image classification done with Mindspore technology☆12Jan 24, 2021Updated 5 years ago
- ☆35Jun 20, 2024Updated last year
- The official repository of the OpenToM dataset☆29Feb 2, 2025Updated last year
- Multiscale 3D Convolutional Network☆15Dec 5, 2021Updated 4 years ago
- The code for the DHViT☆12Mar 6, 2022Updated 4 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- DSHFNet: Dynamic Scale Hierarchical Fusion Network Based on Multiattention for Hyperspectral Image and LiDAR Data Classification.☆20Sep 18, 2023Updated 2 years ago
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆39Feb 24, 2025Updated last year
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- Condense source code for LLM analysis by extracting essential highlights, utilizing a simplified version of Paul Gauthier's repomap techn…☆14Mar 3, 2024Updated 2 years ago
- DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles☆32Mar 8, 2026Updated last week
- ☆31Jul 3, 2025Updated 8 months ago
- [ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing☆27Jan 27, 2026Updated last month
- Manage Workflows with optional Scheduler or Event Arc triggers☆21Feb 24, 2026Updated 3 weeks ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated 11 months ago
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation☆111Apr 16, 2025Updated 11 months ago
- Insurance AI Assistant A smart system combining PostgreSQL, Milvus, and specialized AI agents (Life/Home/Auto) to answer insurance querie…☆23Apr 29, 2025Updated 10 months ago
- Fractional Gabor Convolutional Network for Multi-source Remote Sensing Data Classification☆22Sep 17, 2021Updated 4 years ago
- Delta: LLM conversation branching☆12Dec 30, 2024Updated last year
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆59Feb 22, 2026Updated 3 weeks ago
- Demo code of "PSFormer: Pyramid Superpixel Transformer for Hyperspectral Image Classification"☆22Oct 25, 2024Updated last year
- [CVPR 2026] Official Implementation of Edit2Perceive☆34Feb 21, 2026Updated 3 weeks ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated last month
- Interactive AI Tutor that not just responds in text but engages with with students by "performing actions" on the interactive activity.☆16Oct 13, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- CS194-196 Course Project☆14Feb 20, 2025Updated last year
- ☆20Jun 16, 2020Updated 5 years ago
- A VSCode extension to display relationships between files in a codebase, overlaid on a circle packing diagram of the file structure.☆14Jan 8, 2023Updated 3 years ago
- ☆16Mar 22, 2025Updated 11 months ago
- This is the project page for the HOSNeRF☆16Dec 11, 2023Updated 2 years ago
- The official GitHub page for the survey paper "A Survey on LLM Symbolic Reasoning". And this paper is under review.☆26Feb 15, 2026Updated last month
- ☆21Jun 14, 2024Updated last year
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Sep 17, 2022Updated 3 years ago