univa-agent/univa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/univa-agent/univa)

univa-agent / univa

Official Code Repo for UniVA: Universal Video Agents

☆515

Alternatives and similar repositories for univa

Users that are interested in univa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BestiVictory / HistoryNet
View on GitHub
Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies
☆16Jul 6, 2022Updated 4 years ago
Kartik-3004 / facexbench
View on GitHub
[IEEE T-BIOM] FaceXBench: Evaluating Multimodal LLMs on Face Understanding
☆20Jan 15, 2026Updated 5 months ago
Karl1109 / LIDAR-Mamba
View on GitHub
[ACM MM 2025] LIDAR: Lightweight Adaptive Cue-Aware Fusion Vision Mamba for Multimodal Segmentation of Structural Cracks
☆23Nov 18, 2025Updated 7 months ago
emanueleielo / ciana-parrot
View on GitHub
Self-hosted AI assistant with multi-channel support, scheduled tasks, and extensible skills
☆76Apr 20, 2026Updated 2 months ago
Shenyi-Z / Cache4Diffusion
View on GitHub
Aiming to integrate most existing feature caching-based diffusion acceleration schemes into a unified framework.
☆107Oct 23, 2025Updated 8 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
mlpc-ucsd / PixARMesh
View on GitHub
(CVPR 2026) PixARMesh: Autoregressive Mesh-Native Single-View Scene Reconstruction
☆67May 31, 2026Updated last month
TencentARC / TimeLens
View on GitHub
[CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs
☆153Apr 27, 2026Updated 2 months ago
nenhang / ContextGen
View on GitHub
[ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation
☆83Apr 19, 2026Updated 2 months ago
cablate / Claude-Code-Board
View on GitHub
Use Claude Code on Kanban WebUI
☆151Jan 28, 2026Updated 5 months ago
jiahao6635 / HeyGemWeb
View on GitHub
☆34Jul 23, 2025Updated 11 months ago
HKUDS / ViMax
View on GitHub
"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"
☆10,985Jun 30, 2026Updated last week
Kevin-thu / StoryMem
View on GitHub
Official code for StoryMem: Multi-shot Long Video Storytelling with Memory
☆759May 25, 2026Updated last month
eyalzh / claude-code-toast
View on GitHub
A Claude Code Notification hook for MacOS that displays a toast message when Claude Code is waiting for the user to respond.
☆17Jul 22, 2025Updated 11 months ago
Akshit21112002 / TTRV
View on GitHub
TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)
☆45Mar 8, 2026Updated 4 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
remotion-dev / recorder
View on GitHub
Video production for developers
☆45Jun 22, 2026Updated 2 weeks ago
zhoucz97 / ECPE-MM-R
View on GitHub
[COLING2022] A Multi-turn Machine Reading Comprehension Framework with Rethink Mechanism for Emotion-Cause Pair Extraction
☆18Oct 13, 2022Updated 3 years ago
PKU-YuanGroup / Look-Back
View on GitHub
This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".
☆97Jul 10, 2025Updated last year
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
kandinskylab / kandinsky-5
View on GitHub
Kandinsky 5.0: A family of diffusion models for Video & Image generation
☆784Updated this week
THU-KEG / LongWriter-V
View on GitHub
[ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models
☆24Mar 29, 2025Updated last year
moonmath-ai / LiteAttention
View on GitHub
Transforming Video Diffusion with Temporal Sparse Attention
☆54Apr 8, 2026Updated 3 months ago
salmon-donate / salmon-donate
View on GitHub
An open-source, self-hosted, and non-custodial solution for receiving cryptocurrency donations.
☆36Jul 1, 2025Updated last year
T0UGH / videoclaw
View on GitHub
AI 视频创作 CLI 工具，深度集成 Seedance 2.0 + Chatgpt Imagen2 业界顶级模型，告诉 AI 你的想法，它会自动完成从素材生成到视频合成的全部工作，并支持自动发布到抖音、快手等平台。
☆109May 18, 2026Updated last month
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
opensearch-project / dashboards-flow-framework
View on GitHub
A UI designer for constructing AI applications with OpenSearch
☆16Updated this week
maxgfr / csv-ai-analyzer
View on GitHub
A self-hosted, browser-based AI CSV analyzer
☆81Updated this week
DGAzr / ourschool
View on GitHub
Free and open source application for tracking and managing a home schooling program.
☆58Updated this week
ziplab / CoV
View on GitHub
[ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning
☆63Apr 7, 2026Updated 3 months ago
Heven-Pan / UFVideo
View on GitHub
[CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models
☆37Feb 21, 2026Updated 4 months ago
SLIT-AI / WRPO
View on GitHub
[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
☆14Mar 17, 2025Updated last year
longtaojiang / SmartEraser
View on GitHub
[CVPR 2025] Official implementation of the paper "SmartEraser: Remove Anything from Images using Masked-Region Guidance".
☆205Jul 1, 2025Updated last year
Biangbiang0321 / SpotEdit
View on GitHub
SpotEdit:Selective Region Editing in Diffusion Transformers
☆194Jul 1, 2026Updated last week
Kr1sJFU / iMontage
View on GitHub
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
☆188Dec 1, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
drehelis / gcp-emulator-ui
View on GitHub
A modern web interface for managing Google Cloud Platform emulator services 🎮
☆45Jun 29, 2026Updated last week
aschmelyun / tsplice
View on GitHub
Splice and merge videos from the terminal
☆25Oct 4, 2025Updated 9 months ago
KAIST-Visual-AI-Group / Flow-Inference-Time-Scaling
View on GitHub
[NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
☆75Oct 12, 2025Updated 8 months ago
ARadRareness / mcp-registry
View on GitHub
A central registry and HTTP interface for coordinating Model Context Protocol (MCP) servers.
☆35Dec 29, 2024Updated last year
QuanjianSong / FashionChameleon
View on GitHub
Official Pytorch Code of the Paper "FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization"
☆250May 31, 2026Updated last month
sejmoonwei / SPGrasp
View on GitHub
Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.
☆20Jun 2, 2026Updated last month
zsgvivo / VideoZoomer
View on GitHub
☆34Feb 12, 2026Updated 4 months ago