Official Code Repo for UniVA: Universal Video Agents
☆491May 9, 2026Updated 2 weeks ago
Alternatives and similar repositories for univa
Users that are interested in univa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies☆16Jul 6, 2022Updated 3 years ago
- [IEEE T-BIOM] FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆20Jan 15, 2026Updated 4 months ago
- python脚本监听抖音用户作品以及粉丝变动然后推送☆13Apr 13, 2023Updated 3 years ago
- [ACM MM 2025] LIDAR: Lightweight Adaptive Cue-Aware Fusion Vision Mamba for Multimodal Segmentation of Structural Cracks☆23Nov 18, 2025Updated 6 months ago
- v0.6,主要框架;api+本地模型调用;前端界面;长篇大纲;结构化信息构建;HITL☆36May 6, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆48Jul 17, 2025Updated 10 months ago
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs☆134Apr 27, 2026Updated last month
- Camera app drawn on SkiaSharp canvas with real-time SKSL shaders. Built-in desktop shader editor. Made with DrawnUI for .NET MAUI.☆26Apr 21, 2026Updated last month
- ☆77Dec 8, 2025Updated 5 months ago
- [ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation☆77Apr 19, 2026Updated last month
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆21Feb 14, 2025Updated last year
- MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.☆542Apr 30, 2026Updated last month
- ☆32Jul 23, 2025Updated 10 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆64Mar 19, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memory☆729Jan 22, 2026Updated 4 months ago
- "ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"☆7,472Mar 29, 2026Updated 2 months ago
- Video production for developers☆41May 1, 2026Updated 3 weeks ago
- Official implementation of "InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention" (NeurIPS 2025)☆43May 5, 2026Updated 3 weeks ago
- This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".☆92Jul 10, 2025Updated 10 months ago
- Kandinsky 5.0: A family of diffusion models for Video & Image generation☆767May 22, 2026Updated last week
- Evaluation Tool for Anomaly Detection Research☆17May 9, 2024Updated 2 years ago
- Transforming Video Diffusion with Temporal Sparse Attention☆47Apr 8, 2026Updated last month
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆22Mar 29, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆25Oct 17, 2024Updated last year
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆87Nov 4, 2025Updated 6 months ago
- AI 视频创作 CLI 工具,深度集成 Seedance 2.0 + Nano Banana Pro 业界顶级模型,告诉 AI 你的想法,它会自动完成从素材生成到视频合成的全部工作,并支持自动发布到抖音、快手等平台。☆93May 18, 2026Updated last week
- Official code for ''RAG Meets Temporal Graphs: Time-Sensitive Modeling and Retrieval for Evolving Knowledge''.☆32Feb 25, 2026Updated 3 months ago
- Simple application for tracking and managing a home schooling program.☆40Sep 13, 2025Updated 8 months ago
- A UI designer for constructing AI applications with OpenSearch☆16Updated this week
- 🍑 relsim: Relational Visual Similarity | pip install relsim 🌍 (CVPR 2026)☆77Apr 8, 2026Updated last month
- ☆10Apr 16, 2023Updated 3 years ago
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆37Feb 21, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2025] Official implementation of the paper "SmartEraser: Remove Anything from Images using Masked-Region Guidance".☆201Jul 1, 2025Updated 10 months ago
- iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation☆186Dec 1, 2025Updated 5 months ago
- Splice and merge videos from the terminal☆25Oct 4, 2025Updated 7 months ago
- A modern web interface for managing Google Cloud Platform emulator services 🎮☆41May 18, 2026Updated last week
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆74Oct 12, 2025Updated 7 months ago
- A central registry and HTTP interface for coordinating Model Context Protocol (MCP) servers.☆35Dec 29, 2024Updated last year
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jan 6, 2026Updated 4 months ago