ZackHodari/discrete_intonation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZackHodari/discrete_intonation)

ZackHodari / discrete_intonation

Code for paper titled "Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0" submitted to Speech Prosody

☆17

Alternatives and similar repositories for discrete_intonation

Users that are interested in discrete_intonation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChangLabUcsf / intonatang
View on GitHub
Analysis code for speech intonation project
☆14Feb 19, 2019Updated 7 years ago
ZackHodari / average_prosody
View on GitHub
Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…
☆24Dec 8, 2019Updated 6 years ago
gerazov / PySFC
View on GitHub
Python implementation of the SFC intonation model.
☆18Nov 29, 2017Updated 8 years ago
qqueing / pytorch-G2P
View on GitHub
(semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean
☆23Dec 17, 2017Updated 8 years ago
choiHkk / Transformer-TTS-V2
View on GitHub
☆25Mar 6, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
zhitko / inton-trainer
View on GitHub
Inton Trainer is designed for learning the intonation of oral speech.
☆13Jun 27, 2026Updated last month
keonlee9420 / Daft-Exprt
View on GitHub
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
☆55Oct 15, 2021Updated 4 years ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
seungheondoh / speech-to-music
View on GitHub
Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]
☆17Aug 16, 2023Updated 2 years ago
Yoshifumi-Nakano / visual-text-to-speech
View on GitHub
visual-text to speech
☆14Apr 3, 2022Updated 4 years ago
yunyikristy / ttsGAN-ICLR2019
View on GitHub
☆25Apr 24, 2019Updated 7 years ago
freds0 / CML-TTS-Dataset
View on GitHub
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆36Jul 31, 2024Updated last year
adasegroup / OSM-one-shot-multispeaker
View on GitHub
Framework for one-shot multispeaker system based on Deep Learning
☆19May 30, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rosrad / asvspoof2017
View on GitHub
some scripts for asvspoof2017
☆11Dec 27, 2018Updated 7 years ago
jzmzhong / Automatic-Prosody-Annotator-with-SSWP-CLAP
View on GitHub
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
☆50Jun 11, 2024Updated 2 years ago
keonlee9420 / FastPitchFormant
View on GitHub
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
☆74Aug 3, 2021Updated 4 years ago
CODEJIN / VITS_Diffusion
View on GitHub
☆26Sep 22, 2022Updated 3 years ago
dhgrs / pytorch-UniWaveNet
View on GitHub
☆31Nov 7, 2018Updated 7 years ago
juanmvsa / CursoPLN-UNAM
View on GitHub
Material usado durante el curso "Introducción al Procesamiento Natural con Python" del Grupo de Ingeniería Lingüistica de la UNAM.
☆17Apr 11, 2022Updated 4 years ago
ttslr / StrengthNet
View on GitHub
[INTERSPEECH'2022] Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
☆83Nov 4, 2022Updated 3 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
AI-S2-Lab / FluentEditor
View on GitHub
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆62Oct 23, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
Edresson / ZS-TTS-Evaluation
View on GitHub
☆45Sep 19, 2024Updated last year
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
View on GitHub
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Dec 1, 2021Updated 4 years ago
yl4579 / PL-BERT
View on GitHub
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆270Jan 13, 2025Updated last year
amazon-science / proteno
View on GitHub
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45May 25, 2021Updated 5 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
BiSinger-SVS / BiSinger
View on GitHub
Bilingual Singing Voice Synthesis
☆18Mar 25, 2024Updated 2 years ago
hash2430 / pitchtron
View on GitHub
TTS for pitch-accented language. Korean dialect DB.
☆155May 12, 2023Updated 3 years ago
AI-S2-Lab / GPT-Talker
View on GitHub
[ACMMM'2024] Generative Expressive Conversational Speech Synthesis
☆45Oct 28, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
pilarOG / prosodic-analysis
View on GitHub
Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality
☆23Jun 17, 2019Updated 7 years ago
bronichern / DeepFry
View on GitHub
☆13Jun 29, 2025Updated last year
candlewill / RawNet
View on GitHub
RawNet: Fast End-to-End Neural Vocoder
☆43May 29, 2019Updated 7 years ago
nii-yamagishilab / self-attention-tacotron
View on GitHub
An implementation of "Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language" …
☆114Jun 19, 2020Updated 6 years ago
keonlee9420 / Comprehensive-Tacotron2
View on GitHub
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…
☆49Jul 31, 2023Updated 2 years ago
CSTR-Edinburgh / ophelia
View on GitHub
Sequence-to-sequence TTS based on Kyubyong's dc_tts
☆61Feb 2, 2023Updated 3 years ago
hcy71o / TransferTTS
View on GitHub
TransferTTS (Zero-Shot learning of VITS)
☆102Sep 23, 2022Updated 3 years ago