JackVinati / WaveWizardLinks
A Gradio app for analyzing audio files to determine true sample rate and bit depth.
☆19Updated last year
Alternatives and similar repositories for WaveWizard
Users that are interested in WaveWizard are comparing it to the libraries listed below
Sorting:
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Updated last year
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆136Updated last year
- ☆34Updated 3 months ago
- ☆46Updated 2 months ago
- This repository provides a minimal, single-file implementation of SingLoRA (Single Matrix Low-Rank Adaptation) as described in the paper …☆44Updated this week
- Text and image to video generation: Kandinsky 4.0 (2024)☆149Updated last year
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated last year
- ☆77Updated 9 months ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆43Updated last year
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆139Updated 10 months ago
- Paper: "From Text to Pose to Image: Improving Diffusion Model Control and Quality"☆57Updated last year
- An official implementation of SwapAnyone.☆74Updated 10 months ago
- This is the official implementation of "T-LoRA: Single Image Diffusion Model Customization Without Overfitting"☆127Updated 2 weeks ago
- Kandinsky x Deforum — generating short animations☆105Updated 2 years ago
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆40Updated last week
- ☆23Updated last year
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆66Updated last year
- ☆25Updated 2 years ago
- Music production for silent film clips.☆32Updated 9 months ago
- ☆65Updated last year
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Updated 3 weeks ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆61Updated last month
- Making Flux go brrr on GPUs.☆161Updated last month
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆29Updated 3 weeks ago
- ☆13Updated last year
- ☆48Updated 11 months ago
- Enhancing Motion Dynamics of Image-to-Video Models via Adaptive Low-Pass Guidance (arXiv 2025)☆54Updated 6 months ago
- LIA-X: Interpretable Latent Portrait Animator☆98Updated 4 months ago
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆21Updated last year
- ☆148Updated last month