abhishek0093 / autoFoleyLinks
Model : Give me a silent video...And I'm gonna to tell you what's happening in the video.. Will also add a new relevant background audio each time you use me. ..Thanks to deep Learning!!
☆20Updated 2 years ago
Alternatives and similar repositories for autoFoley
Users that are interested in autoFoley are comparing it to the libraries listed below
Sorting:
- Prepare spectrograms from audio for training a Riffusion model☆15Updated 2 years ago
- Audio datasets, easier.☆84Updated last year
- Decked-out gradio client for audio diffusion, mainly stable-audio-tools.☆37Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆122Updated last week
- ☆27Updated last year
- text-to-audio-latent-diffusion☆37Updated last year
- ☆11Updated last year
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆30Updated last year
- Chord conditioning implemented MusicGen☆57Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆87Updated 6 months ago
- StoryDiffusion serverless worker☆17Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 10 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- ☆19Updated 9 months ago
- 🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video ga…☆41Updated 2 years ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆45Updated 3 months ago
- ☆75Updated last year
- tools to manipulate audio with riffusion☆95Updated last year
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- Retrieval-based Voice Conversion (RVC) implemented with Hugging Face Transformers.☆68Updated 3 weeks ago
- Diffusion Animation Toolkit☆37Updated last year
- XTTS: Multilingual Voice Cloning TTS Model by Coqui Deployed to Replicate☆24Updated last year
- Safely push a Cog model version by making sure it works and is backwards-compatible with previous versions.☆16Updated 2 weeks ago
- Cog implementation of the ByteDance/Hyper-SD Flux.1-Dev 8-step LoRA☆14Updated 3 months ago
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- ☆96Updated last year
- Resources on AI applications in the music domain☆18Updated 3 months ago
- ☆62Updated 11 months ago