abhishek0093 / autoFoleyLinks
Model : Give me a silent video...And I'm gonna to tell you what's happening in the video.. Will also add a new relevant background audio each time you use me. ..Thanks to deep Learning!!
☆20Updated 3 years ago
Alternatives and similar repositories for autoFoley
Users that are interested in autoFoley are comparing it to the libraries listed below
Sorting:
- ☆75Updated 2 years ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆49Updated 10 months ago
- ☆187Updated 2 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆123Updated 7 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆60Updated last year
- ☆96Updated 2 years ago
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆34Updated 2 years ago
- Sing an idea ➡️ AI music sample🔥🎶☆119Updated last year
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆32Updated 2 years ago
- fine-tuning MusicGen without prompts to generate music with a specific style☆66Updated 2 years ago
- Cog wrapper for FalconsAi / nsfw_image_detection☆18Updated 6 months ago
- Text-to-Music Generation with Rectified Flow Transformer☆64Updated 8 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 3 years ago
- ☆62Updated last year
- Gradio UI for YuE☆89Updated 10 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆61Updated last year
- SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution☆14Updated 2 years ago
- YuE with mp3 extend, exllama and GUI☆64Updated 11 months ago
- Awesome music generation model——MG²☆165Updated 10 months ago
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆88Updated last year
- ☆23Updated 2 years ago
- Attempt at cog wrapper using ComfyUI to run a SDXL txt2img workflow config☆23Updated 2 years ago
- Audio datasets, easier.☆86Updated 2 years ago
- 🎼 text-to-video system for music visualization☆56Updated last year
- Create training data for training a voice cloner for bark text to speech.☆48Updated 2 years ago
- ☆107Updated 2 years ago
- ☆27Updated 2 years ago
- text-to-audio-latent-diffusion☆37Updated 2 years ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Updated last year