abhishek0093 / autoFoley
Model : Give me a silent video...And I'm gonna to tell you what's happening in the video.. Will also add a new relevant background audio each time you use me. ..Thanks to deep Learning!!
☆21Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for autoFoley
- Site for sharing MusicGen + AudioGen Prompts and Creations☆39Updated 4 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆115Updated 8 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Updated 7 months ago
- ImageBind One Embedding Space to Bind Them All☆18Updated last year
- Video restoration Processing Pipeline☆27Updated 7 months ago
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆27Updated 10 months ago
- Audio datasets, easier.☆83Updated last year
- Versatile AI-driven audio upscaler to enhance the quality of any audio.☆60Updated 2 months ago
- Prepare spectrograms from audio for training a Riffusion model☆13Updated last year
- fine-tuning MusicGen without prompts to generate music with a specific style☆56Updated last year
- ☆27Updated last year
- [IEEE/CVF CVPR'2022] "ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation", Duolikun Danier, Fan Zhang, David Bull☆12Updated last year
- ☆22Updated 11 months ago
- Simple local all-in-one install for IDEA2.ART☆26Updated last year
- 🎵 LyricWave - Your AI Music Composer 🎶 Compose Unique MP4 Songs Effortlessly! LyricWave uses AI to create personalized music by harmoni…☆27Updated 7 months ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆56Updated 8 months ago
- ☆18Updated 2 months ago
- ☆96Updated last year
- dgenerate is a scriptable command line tool (and library) for generating images and animation sequences using stable diffusion and relate…☆26Updated last week
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- GUI for a Vocal Remover that uses Deep Neural Networks.☆14Updated 10 months ago
- Text prompt steered synthetic audio generators☆45Updated 11 months ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆50Updated 2 years ago
- Cog wrapper for collabora/WhisperSpeech☆25Updated 8 months ago
- text-to-audio-latent-diffusion☆35Updated last year
- ☆14Updated 3 months ago
- Diffusion Animation Toolkit☆33Updated last year
- One-shot face animation using webcam, capable of running in real time.☆31Updated 5 months ago
- Resonance: Audio-Image Interconversion for AI Diffusion Models☆19Updated 7 months ago