official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".
☆51Dec 25, 2025Updated 2 months ago
Alternatives and similar repositories for SSVAE
Users that are interested in SSVAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆55Dec 25, 2025Updated 2 months ago
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 3 months ago
- ☆23Jul 20, 2025Updated 8 months ago
- [AAAI 2025] AeroGTO: An Efficient Graph-Transformer Operator for Learning Large-Scale Aerodynamics of 3D Vehicle Geometries☆22Jul 20, 2025Updated 8 months ago
- Official implementation of "Where and How to Perturb: On the Design of Perturbation Guidance in Diffusion and Flow Models"☆35Nov 30, 2025Updated 3 months ago
- ☆16Dec 6, 2014Updated 11 years ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆45Nov 24, 2025Updated 3 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆47Mar 3, 2026Updated 2 weeks ago
- PyTorch code for our paper "Progressive Binarization with Semi-Structured Pruning for LLMs"☆13Mar 11, 2026Updated last week
- ☆59Nov 12, 2025Updated 4 months ago
- ☆46Mar 12, 2026Updated last week
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- ☆22Nov 18, 2025Updated 4 months ago
- [[NeurIPS 2025] UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions☆85Jul 14, 2025Updated 8 months ago
- This is the official repository for "BokehDiff: Neural Lens Blur with One-Step Diffusion" (ICCV'25).☆46Sep 12, 2025Updated 6 months ago
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 6 months ago
- Raw-to-End Name Entity Recognition in Social Media☆16Oct 16, 2019Updated 6 years ago
- [ AAAI26 ]: “VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame Interpolation”☆17Mar 9, 2026Updated 2 weeks ago
- Reading Group @ DMG☆11Nov 15, 2018Updated 7 years ago
- ☆110Sep 3, 2025Updated 6 months ago
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Train…☆28Updated this week
- [AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization☆36Dec 9, 2025Updated 3 months ago
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Jul 24, 2025Updated 8 months ago
- RuDaS: Synthetic Datasets for Rule Learning☆19Jun 21, 2022Updated 3 years ago
- Visual Generation Tuning☆99Jan 27, 2026Updated last month
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆36Oct 15, 2025Updated 5 months ago
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 7 months ago
- EARL: Editing with Autoregression and RL☆42Nov 21, 2025Updated 4 months ago
- Official codes of the 1st place for The NVIDIA AI City Challenge 2023 - Track 2☆19Jul 25, 2023Updated 2 years ago
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆62Jul 1, 2025Updated 8 months ago
- ☆14Oct 3, 2025Updated 5 months ago
- ☆13May 2, 2025Updated 10 months ago
- [ICCV 2025] CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers☆16Mar 3, 2026Updated 2 weeks ago
- [CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models☆52Feb 21, 2026Updated last month
- 语音合成VITS 纯中文微调☆12Mar 15, 2023Updated 3 years ago
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆55Oct 29, 2024Updated last year
- ☆20Jan 1, 2026Updated 2 months ago
- Library about construction helper for Generative models e.g. Flow-based Model with Tensorflow 2.x.☆12Feb 16, 2023Updated 3 years ago
- [TMM 2025] Official Implementation of DreamJourney: Perpetual View Generation with Video Diffusion Models☆18Jun 24, 2025Updated 9 months ago