abhishek0093 / autoFoley
Model : Give me a silent video...And I'm gonna to tell you what's happening in the video.. Will also add a new relevant background audio each time you use me. ..Thanks to deep Learning!!
☆20Updated 2 years ago
Alternatives and similar repositories for autoFoley:
Users that are interested in autoFoley are comparing it to the libraries listed below
- text-to-audio-latent-diffusion☆37Updated last year
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆28Updated last year
- ☆27Updated last year
- Audio datasets, easier.☆84Updated last year
- A flexible gateway for running ML inference jobs through cloud providers or your own GPU. Powered by Replicate and Cloudflare Workers.☆28Updated 2 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆57Updated last year
- The code for some apps built with Sieve.☆77Updated 5 months ago
- ☆13Updated 2 years ago
- Simple local all-in-one install for IDEA2.ART☆26Updated 2 years ago
- 🎵 LyricWave - Your AI Music Composer 🎶 Compose Unique MP4 Songs Effortlessly! LyricWave uses AI to create personalized music by harmoni…☆30Updated last month
- Resources on AI applications in the music domain☆17Updated last month
- Site for sharing MusicGen + AudioGen Prompts and Creations☆41Updated last month
- A universal version of TheLastBen's fast-stable-diffusion - no longer maintained☆11Updated 7 months ago
- A collection of pre-trained audio models, in PyTorch.☆113Updated 2 years ago
- A neural network based file sorter. Trains an autoencoder to sort images or audio based on the similarity of their encodings, or uses the…☆30Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆120Updated this week
- ☆22Updated last year
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- tools to manipulate audio with riffusion☆93Updated last year
- Implementations of zero-shot capabilities with Open AI's CLIP and computer vision models☆34Updated 7 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆14Updated 6 months ago
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆84Updated 4 months ago
- Decked-out gradio client for audio diffusion, mainly stable-audio-tools.☆37Updated 10 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated last year
- Gradio UI for YuE☆39Updated 2 weeks ago
- Fork of AudioLDM as a TuneFlow plugin☆40Updated 2 years ago
- Replicate COG Prompt Parrot☆25Updated 2 years ago
- Prepare spectrograms from audio for training a Riffusion model☆15Updated 2 years ago
- BIG: Back In the Game of Creative AI☆27Updated 2 years ago
- Convert an audio file to a waveform video☆10Updated last year