knoriy / CLARA
☆61Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for CLARA
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆36Updated last month
- The demo page of UniAudio☆34Updated 9 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆42Updated this week
- VoiceBox neural network implementation☆96Updated 3 months ago
- VALL-E 2 reproduction☆83Updated 3 months ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 5 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆62Updated last week
- ☆251Updated 7 months ago
- lina-speech : linear attention based text-to-speech☆134Updated this week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- ☆59Updated last year
- Codebase and project page for EDMSound☆29Updated 11 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 4 months ago
- ☆76Updated 2 months ago
- ☆34Updated 6 months ago
- Audio tokenization, in the fastest way possible!☆45Updated 2 months ago
- Collection of scripts from mHuBERT-147.☆22Updated 4 months ago
- Supervoice diffusion enhance☆25Updated 3 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆81Updated last month
- Joint speech-language model - respond directly to audio!☆30Updated 5 months ago
- The official Implementation of PeriodWave and PeriodWave-Turbo☆128Updated 2 months ago
- Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language☆61Updated 4 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆129Updated last year
- Collection of Open Source Speech Data☆143Updated this week
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆71Updated 2 months ago
- Pytorch implementation of SoundCTM☆70Updated last month
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆76Updated 10 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆79Updated last year
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆28Updated 3 weeks ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆124Updated 5 months ago