JackVinati / WaveWizardLinks
A Gradio app for analyzing audio files to determine true sample rate and bit depth.
☆18Updated last year
Alternatives and similar repositories for WaveWizard
Users that are interested in WaveWizard are comparing it to the libraries listed below
Sorting:
- ☆78Updated 6 months ago
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆136Updated last year
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Updated last year
- ☆45Updated 11 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆148Updated 10 months ago
- An official implementation of SwapAnyone.☆69Updated 7 months ago
- The official GitHub Page for MiniMax☆58Updated last week
- This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"☆123Updated 4 months ago
- Kandinsky x Deforum — generating short animations☆104Updated last year
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆43Updated last year
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆133Updated 7 months ago
- Kandinsky 5.0: A family of diffusion models for Video & Image generation☆176Updated last week
- ☆69Updated last year
- ☆20Updated last year
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆70Updated 7 months ago
- Making Flux go brrr on GPUs.☆151Updated 3 months ago
- LIA-X: Interpretable Latent Portrait Animator☆89Updated last month
- Official Implementation for "Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing"☆54Updated last year
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆65Updated 10 months ago
- ☆60Updated last year
- This repository provides a minimal, single-file implementation of SingLoRA (Single Matrix Low-Rank Adaptation) as described in the paper …☆44Updated 3 weeks ago
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆124Updated 3 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated last year
- ☆23Updated last year
- Official Implementation for "The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Ed…☆107Updated last year
- ☆13Updated last year
- faster parallel inference of mochi-1 video generation model☆125Updated 8 months ago
- [AAAI 2025] The official repository of UniMuMo☆121Updated last month
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (Arxiv 2025)☆34Updated 4 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated last year