Gabeiscool420 / SoundSage---LLM-Audio-Processing
Open source Python program for automating gain staging. part 1 of a series for automating audio processing tasks, end goal is to create a full set of tools for an AI to use for automating Audio processing for Music, Film, Game and any other possible applications. UI for AutoGain is very basic but the app is very functional. currently only for Ma…
☆33Updated last year
Related projects ⓘ
Alternatives and complementary repositories for SoundSage---LLM-Audio-Processing
- Fork of AudioLDM as a TuneFlow plugin☆38Updated last year
- Chord conditioning implemented MusicGen☆43Updated 7 months ago
- Text prompt steered synthetic audio generators☆44Updated 11 months ago
- ☆253Updated 5 months ago
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆233Updated 2 weeks ago
- Audio generation using diffusion models, in PyTorch.☆46Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆83Updated 6 months ago
- A curated list of awesome OpenAI's Whisper☆93Updated last year
- Versatile AI-driven audio upscaler to enhance the quality of any audio.☆59Updated last month
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆49Updated last year
- On-device Speech-to-Index engine powered by deep learning☆34Updated last month
- The Nendo AI Audio Tool Suite☆210Updated 6 months ago
- Sing an idea ➡️ AI music sample🔥🎶☆90Updated 6 months ago
- ☆107Updated last year
- ☆13Updated last year
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆146Updated 2 months ago
- ☆139Updated 3 weeks ago
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- a notebook containing scripts, documentation, and examples for finetuning musicgen☆75Updated 7 months ago
- A collection of pre-trained audio models, in PyTorch.☆110Updated last year
- Nendo plugin for MusicGen: A state-of-the-art controllable text-to-music model (by Meta Research)☆15Updated 7 months ago
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆53Updated this week
- text-to-audio-latent-diffusion☆34Updated last year
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆81Updated last month
- Cog wrapper for collabora/WhisperSpeech☆24Updated 8 months ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆139Updated 10 months ago
- Demos of Essentia models hosted on Replicate.com☆40Updated 4 months ago
- Auto-Video maker handling many AI's☆12Updated 7 months ago
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆11Updated 11 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆53Updated 6 months ago