JackVinati / WaveWizardLinks
A Gradio app for analyzing audio files to determine true sample rate and bit depth.
☆17Updated 8 months ago
Alternatives and similar repositories for WaveWizard
Users that are interested in WaveWizard are comparing it to the libraries listed below
Sorting:
- The official GitHub Page for MiniMax☆38Updated last week
- ☆46Updated 6 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆17Updated 9 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆46Updated 8 months ago
- ☆22Updated 7 months ago
- A minimalistic, hackable code base to finetune Wan video generation model☆40Updated last month
- ☆70Updated last month
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆20Updated 8 months ago
- Pytorch implementation of Towards Consistent and Controllable Image Synthesis for Face Editing☆55Updated last month
- ☆30Updated 8 months ago
- ☆14Updated 11 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 6 months ago
- Official implementation of MagicFace: Training-free Universal-Style Human Image Customized Synthesis.☆63Updated 5 months ago
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆133Updated 8 months ago
- This repository shows how to use Q8 kernels with `diffusers` to optimize inference of LTX-Video on ADA GPUs.☆17Updated 5 months ago
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆115Updated 3 months ago
- faster parallel inference of mochi-1 video generation model☆121Updated 3 months ago
- An official implementation of SwapAnyone.☆62Updated 2 months ago
- A new one shot head swapping approach☆81Updated 3 months ago
- ☆79Updated 3 months ago
- Kandinsky x Deforum — generating short animations☆104Updated last year
- Fine-tune of Florence-2 for shot categorization.☆24Updated 3 months ago
- ☆27Updated 7 months ago
- ☆24Updated last year
- Official code for SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound☆124Updated 2 months ago
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (arXiv, 2024)☆51Updated 6 months ago
- Text and image to video generation: Kandinsky 4.0 (2024)☆145Updated 5 months ago
- Animatediff implementation. Includes a ControlNet pipeline.☆18Updated last year
- ☆71Updated 7 months ago
- Paper: "From Text to Pose to Image: Improving Diffusion Model Control and Quality"☆51Updated 6 months ago