Official Code Repo for UniVA: Universal Video Agents
☆421Jan 27, 2026Updated last month
Alternatives and similar repositories for univa
Users that are interested in univa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies☆15Jul 6, 2022Updated 3 years ago
- [CVPR 2026] Official Implementation of Edit2Perceive☆34Feb 21, 2026Updated last month
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- "ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"☆2,501Feb 28, 2026Updated 3 weeks ago
- AI 视频创作 CLI 工具,深度集成 Seedance 2.0 + Nano Banana Pro 业界顶级模型,告诉 AI 你的想法,它会自动完成从素材生成到视频合成的全部工作,并支持自动发布到抖音、快手等平台。☆67Mar 7, 2026Updated 2 weeks ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆48Jul 17, 2025Updated 8 months ago
- Aiming to integrate most existing feature caching-based diffusion acceleration schemes into a unified framework.☆97Oct 23, 2025Updated 5 months ago
- Camera app drawn on SkiaSharp canvas with real-time SKSL shaders. Built-in desktop shader editor. Made with DrawnUI for .NET MAUI.☆22Mar 8, 2026Updated 2 weeks ago
- [ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation☆70Feb 12, 2026Updated last month
- ☆63Nov 23, 2025Updated 4 months ago
- ☆222Updated this week
- \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation☆19Feb 14, 2025Updated last year
- ☆30Jul 23, 2025Updated 8 months ago
- Use Claude Code on Kanban WebUI☆147Jan 28, 2026Updated last month
- Automatically extract executable programs from pruned mechanistic circuits, extending OpenAI's Sparse Circuits☆66Nov 23, 2025Updated 4 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆59Dec 26, 2025Updated 2 months ago
- This includes 2 separate tutorial series for OpenAI swarm library each 10 files from basic to advanced☆14Jan 14, 2025Updated last year
- ICML 2025 Spotlight, PCEvolve: Private Contrastive Evolution for Synthetic Dataset Generation via Few-Shot Private Data and Generative AP…☆14Jun 27, 2025Updated 8 months ago
- In this project I used apache airflow to scrape website periodically. This is for the tutorials I do on youtube.☆10Nov 21, 2022Updated 3 years ago
- Code for the paper: "Modular Neural Image Signal Processing". A modular neural ISP with interpretable stages, multi-style rendering, cros…☆34Jan 19, 2026Updated 2 months ago
- Kandinsky 5.0: A family of diffusion models for Video & Image generation☆732Mar 6, 2026Updated 2 weeks ago
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆37Mar 8, 2026Updated 2 weeks ago
- Official implementation of "InstanceAssemble: Layout-Aware Image Generation via Instance Assembling Attention" (NeurIPS 2025)☆41Oct 17, 2025Updated 5 months ago
- A framework for hosting and scaling AI agents.☆40Nov 25, 2024Updated last year
- CoV: Chain-of-View Prompting for Spatial Reasoning☆52Jan 23, 2026Updated 2 months ago
- Get started setting up infrastructure as code on Google Cloud Platform☆11Jun 13, 2021Updated 4 years ago
- [COLING2022] A Multi-turn Machine Reading Comprehension Framework with Rethink Mechanism for Emotion-Cause Pair Extraction☆18Oct 13, 2022Updated 3 years ago
- ☆87Feb 14, 2026Updated last month
- An open-source, self-hosted, and non-custodial solution for receiving cryptocurrency donations.☆37Jul 1, 2025Updated 8 months ago
- Local-first AI knowledge layer. Extract architecture, query from any AI tool via MCP. Private by architecture.☆76Updated this week
- Transforming Video Diffusion with Temporal Sparse Attention☆47Updated this week
- Simple application for tracking and managing a home schooling program.☆40Sep 13, 2025Updated 6 months ago
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆26Oct 17, 2024Updated last year
- SpotEdit:Selective Region Editing in Diffusion Transformers☆176Jan 5, 2026Updated 2 months ago
- A UI designer for constructing AI applications with OpenSearch☆16Mar 13, 2026Updated last week
- [ACM MM25] LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆23Mar 29, 2025Updated 11 months ago
- ☆98May 22, 2024Updated last year
- Official code for StoryMem: Multi-shot Long Video Storytelling with Memory☆704Jan 22, 2026Updated 2 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year