ttslr/MonTTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ttslr/MonTTS)

ttslr / MonTTS

☆16

Alternatives and similar repositories for MonTTS

Users that are interested in MonTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
v-nhandt21 / MusicVoiceConversion
View on GitHub
Sing any popular song with your voice
☆11Jul 10, 2022Updated 4 years ago
hardmstar / BhaDistanceBasedZCRofGaussian
View on GitHub
calculate bhattacharyya distance based on zero cross rate feature between different Gaussian model for speech emotion recognition. corpus…
☆11Oct 17, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wentaozhu / speechnas
View on GitHub
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
☆30Mar 24, 2023Updated 3 years ago
wenet-e2e / wecut
View on GitHub
video cut powered by AI
☆23Nov 15, 2022Updated 3 years ago
rishikksh20 / Zero-Shot-TTS
View on GitHub
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
☆34Sep 24, 2021Updated 4 years ago
danpovey / quantization
View on GitHub
Torch-based tool for quantizing high-dimensional vectors using additive codebooks
☆54May 25, 2022Updated 4 years ago
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
Neclow / SERAB
View on GitHub
SERAB: a multi-lingual benchmark for speech emotion recognition
☆28Dec 16, 2022Updated 3 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
fun-audio-llm / fun-audio-llm.github.io
View on GitHub
FunAudioLLM homepage
☆17Dec 11, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
zzw922cn / wesinger2
View on GitHub
Synthesized singing voice demos of WeSinger 2 paper.
☆26Feb 20, 2023Updated 3 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
enod / mongolian-bert-ner
View on GitHub
Pytorch-Named-Entity-Recognition-with-BERT
☆15Oct 31, 2020Updated 5 years ago
hcy71o / SNAC
View on GitHub
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…
☆57Aug 7, 2023Updated 2 years ago
2prime / OpenBlackBox
View on GitHub
☆12Nov 5, 2019Updated 6 years ago
vliu15 / adversarial-tts
View on GitHub
End-to-end Text-to-Speech with Generative Adversarial Networks
☆20Feb 6, 2021Updated 5 years ago
anzeyimana / kinyabert-acl2022
View on GitHub
☆19Feb 4, 2024Updated 2 years ago
fabianoluzbr / neural-g2p-portuguese
View on GitHub
Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…
☆19Jun 14, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
View on GitHub
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Dec 1, 2021Updated 4 years ago
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
yluo42 / SRVQ
View on GitHub
Spherical residual vector quantization (SRVQ)
☆31Aug 25, 2024Updated last year
bayartsogt-ya / albert-mongolian
View on GitHub
ALBERT trained on Mongolian text corpus
☆19Jan 10, 2021Updated 5 years ago
zjlww / dsp
View on GitHub
Digital Speech Processing in PyTorch.
☆15Aug 12, 2022Updated 3 years ago
tonnetonne814 / unofficial-vits2-44100-Ja
View on GitHub
44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。
☆24Sep 1, 2023Updated 2 years ago
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
thuhcsi / mm2022-conversational-tts
View on GitHub
☆11May 9, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ttslr / python-MCD
View on GitHub
☆49May 3, 2020Updated 6 years ago
polvanrijn / VoiceMe
View on GitHub
Repository for the paper: VoiceMe: Personalized voice generation in TTS
☆125Apr 29, 2022Updated 4 years ago
LeBenchmark / Interspeech2021
View on GitHub
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆52Oct 8, 2021Updated 4 years ago
vkothapally / Subband-Beamformer
View on GitHub
☆34Nov 29, 2022Updated 3 years ago
keonlee9420 / VAENAR-TTS
View on GitHub
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
☆74Aug 3, 2021Updated 4 years ago
chomeyama / HN-UnifiedSourceFilterGAN
View on GitHub
☆88Nov 1, 2022Updated 3 years ago
keonlee9420 / FastPitchFormant
View on GitHub
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
☆74Aug 3, 2021Updated 4 years ago