ex3ndr / supervoice-separateLinks

Supervoice Speaker Separation Network

☆12

Alternatives and similar repositories for supervoice-separate

Users that are interested in supervoice-separate are comparing it to the libraries listed below

Sorting:

uthree / ddsp-vocoder
☆10Updated 8 months ago
tencent-ailab / TriNet
TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.
☆26Updated 2 years ago
ex3ndr / supervoice-gpt-facodec
GPT for FACodec
☆13Updated last year
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Updated 11 months ago
patrickvonplaten / audio-gen-dreambooth
☆23Updated 2 years ago
Infinity-INF / fast-phasr
Phonemes and durations labeling based on whisper small
☆11Updated last year
audiodemo / voice-conversion
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Updated last year
thuhcsi / PortableTTS
☆12Updated 2 years ago
SLPcourse / Singing-Voice-Conversion
Project of Singing Voice Conversion.
☆15Updated last year
TariqAHassan / HiFiHybrid
Hifi-like Vocoder implemented in PyTorch
☆13Updated 2 years ago
ex3ndr / supervoice-vocoder
Production-ready vocoder using BigVSAN
☆11Updated last year
SonyResearch / diffvox
Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"
☆28Updated this week
Plachtaa / ASTRAL-quantization
speaker-disentangled speech linguistic content quantizer
☆21Updated 3 months ago
y-chan / hifi-gan-misrnet
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Updated 2 years ago
AI-S2-Lab / EmoPP
[NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
☆22Updated 10 months ago
Respaired / RiFornet_Vocoder
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆19Updated 4 months ago
sushant-t / tts-trainer
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆29Updated 2 years ago
shengcanxu / canoSpeech
text to speech
☆10Updated last year
yoongi43 / VRVQ
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Updated 3 months ago
impel-intelligence / dippy-speech-subnet
Dippy Synthetic Speech Subnet
☆16Updated last month
lexkoro / StyleTTS
☆11Updated 2 years ago
kyegomez / Audio-xLSTMs
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆18Updated last week
ORI-Muchim / Grad-TTS
'Grad-TTS' with Multilingual Cleaners
☆10Updated last year
shinhyeokoh / rwen
☆14Updated 2 years ago
mbrotos / SoundSeg
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation
☆13Updated 7 months ago
ShoukanLabs / VoPho
A collection of all our phonemeizers for dataset construction and inference
☆24Updated 4 months ago
reppy4620 / vocoders
My vocoder experiments
☆30Updated this week
Tikai7 / DiTTO-TTS
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆28Updated 5 months ago
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆12Updated 7 months ago
Naozumi520 / g2pW-Cantonese
Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW
☆13Updated 7 months ago