CUC-MIPG / UniVidView external linksLinks
Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026
☆36Nov 24, 2025Updated 2 months ago
Alternatives and similar repositories for UniVid
Users that are interested in UniVid are comparing it to the libraries listed below
Sorting:
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- ☆36Feb 5, 2026Updated last week
- TPDiff: Temporal Pyramid Video Diffusion Model☆23Mar 13, 2025Updated 11 months ago
- Generate image at any resolution.☆43Sep 16, 2025Updated 4 months ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- ☆72Nov 24, 2025Updated 2 months ago
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆88Jun 6, 2025Updated 8 months ago
- A visual novel made with Godot Engine.☆11Sep 18, 2023Updated 2 years ago
- A collection of projects build with detectron2☆33May 6, 2022Updated 3 years ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆42Mar 11, 2025Updated 11 months ago
- Official implementation of "Imaginarium: Vision-guided High-quality 3D Scene Layout Generation"☆41Dec 30, 2025Updated last month
- Modern normalizing flows in Python. Simple to use and easily extensible.☆11Updated this week
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- ☆16Sep 18, 2025Updated 4 months ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆26Feb 4, 2026Updated last week
- SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time☆96Jan 1, 2026Updated last month
- Remove NotebookLM watermarks from slides. Local processing, no upload needed.☆31Jan 15, 2026Updated 3 weeks ago
- 🌟 手把手教你在论文中插入代码链接☆24Aug 2, 2025Updated 6 months ago
- [ICLR2025] Are Large Vision Language Models Good Game Players?☆13Mar 3, 2025Updated 11 months ago
- Multi-AI documentation for OpenClaw: architecture, security audits, deployment guide☆67Updated this week
- Record your screen, trim the clips and export single video file with result. No nonsense screen capture and recording to make quick video…☆24Dec 4, 2025Updated 2 months ago
- PhysWorld: From Real Videos to World Models of Deformable Objects via Physics-Aware Demonstration Synthesis☆34Oct 27, 2025Updated 3 months ago
- ☆16Sep 1, 2025Updated 5 months ago
- Python solutions to coding questions in Leetcode☆13Sep 12, 2020Updated 5 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Turn your prompt salad into sushi! A dev tool to analyze and improve everything your app sends to LLMs☆21Sep 20, 2025Updated 4 months ago
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 4 months ago
- Markdown 编辑器, Android Markdown 编辑器, 安卓 Markdown 软件, 手机 Markdown 工具, Ushio MD,Markdown 实时预览, 语法高亮编辑器, 沉浸式写作工具, 移动端码字神器, Markdown 个性化主题, 自动…☆28Feb 1, 2026Updated last week
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 3 months ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆22Jan 4, 2026Updated last month
- Creating Your Divine Agent 😇☆10Jan 26, 2026Updated 2 weeks ago
- [ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"☆10Sep 28, 2024Updated last year
- Color detection, Contour mapping, Detecting holes, Motion detection☆10Mar 20, 2014Updated 11 years ago
- G-Buffer-Conditioned Diffusion for Neural Forward Frame Rendering.☆23Jan 31, 2026Updated last week
- ☆10Oct 7, 2019Updated 6 years ago
- 多Agent驱动的实时广播电台☆30Updated this week
- A scalable data preprocessing framework built on PySpark for LLM training☆21Dec 9, 2025Updated 2 months ago
- I am curating best Black Friday and Cyber Monday deals for developers, mostly learning resource to prepare for coding and system design i…☆30Nov 26, 2025Updated 2 months ago
- An MCP server that runs AI-driven venture capitalist agents (Fred Wilson, Peter Thiel, etc.), whose thinking is continuously enriched by …☆18May 12, 2025Updated 9 months ago