☆21Jan 22, 2026Updated 3 months ago
Alternatives and similar repositories for UITron-Speech
Users that are interested in UITron-Speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆67Sep 6, 2025Updated 8 months ago
- UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience☆68Apr 3, 2026Updated last month
- Tracking the latest and greatest research papers on diffusion large language models.☆33Mar 13, 2026Updated 2 months ago
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- Graph Convolutional Module for Temporal Action Localization in Videos☆10Jul 4, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An album application.☆15Oct 28, 2025Updated 6 months ago
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆48Sep 15, 2025Updated 8 months ago
- ☆15Dec 11, 2023Updated 2 years ago
- (CVPR 26 Findings) Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-…☆35Apr 7, 2026Updated last month
- Official code for the ICLR 2025 paper, "Ada-K Routing: Boosting the Efficiency of MoE-based LLMs"☆12Mar 1, 2025Updated last year
- ☆10Apr 22, 2021Updated 5 years ago
- [ICLR 2020] Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma, "I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifie…☆20Dec 30, 2021Updated 4 years ago
- Scene Parsing via Integrated Classification Model and Variance-Based Regularization (Matlab&Caffe), In CVPR 2019☆11Jun 11, 2019Updated 6 years ago
- The official repo for "Unified Domain Adaptive Semantic Segmentation" (IEEE TPAMI 2025)☆34Aug 14, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- SimOn: A Simple Framework for Online Temporal Action Localization☆22Nov 12, 2022Updated 3 years ago
- Codes for our ICLR2020 paper: Knowledge Consistency between Neural Networks and Beyond☆16Jan 11, 2020Updated 6 years ago
- ☆14Dec 12, 2023Updated 2 years ago
- rkllm_talking is a standalone compiled voice communication system based on a large model || rkllm_talking 是一个独立编译的基于大模…☆13Oct 13, 2024Updated last year
- [ICLR 2026] VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications☆131Feb 22, 2026Updated 2 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆48May 16, 2025Updated last year
- LLaVA_OpenVLA part 2, Generate MLLM general training data☆11Dec 27, 2024Updated last year
- Advanced Video Graph RAG using SAM2,CLIP,BLIP,Qwen2-VL,YOLO-World ,Neo4j, WebGPU, local LLM☆14Nov 25, 2024Updated last year
- Simple Fast API server that runs Dreambooth fine-tune jobs using Celery workers 🤙☆10Jun 18, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Cobweb: An AI-native, logic-driven game engine.☆54Updated this week
- [AAAI 2022] DCAN: Improving Temporal Action Detection via Dual Context Aggregation☆17Nov 13, 2022Updated 3 years ago
- ☆34Sep 19, 2025Updated 8 months ago
- [ICLR 2022] Official Code Repository for "TRGP: TRUST REGION GRADIENT PROJECTION FOR CONTINUAL LEARNING"☆22Oct 5, 2022Updated 3 years ago
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆28Apr 14, 2026Updated last month
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 4 months ago
- Visualize Action Recognition Models☆11Apr 21, 2017Updated 9 years ago
- [NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents☆57Nov 27, 2025Updated 5 months ago
- Codes available of a paper: An Efficient Cervical Whole Slide Image Analysis Framework Based on Multi-scale Semantic and Location Deep Fe…☆16Jul 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 物业管理系统-前端☆39Dec 4, 2024Updated last year
- Database of "Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model", ECCV 2020☆13May 2, 2022Updated 4 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Semi-Supervised Temporal Action Detection with Proposal-Free Masking "☆21Jun 20, 2023Updated 2 years ago
- Implementation and checkpoints of Imagen, Google's text-to-image synthesis neural network, in Pytorch☆18Dec 22, 2022Updated 3 years ago
- Where is this IP?☆14Feb 24, 2024Updated 2 years ago
- Learning to Discriminate Information for Online Action Detection, CVPR 2020☆27Mar 24, 2023Updated 3 years ago
- A car re-identification app based on multi-feature fusion technique☆18Apr 24, 2022Updated 4 years ago