dafyddg/RFA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dafyddg/RFA)

dafyddg / RFA

Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectrum and spectrogram.

☆17

Alternatives and similar repositories for RFA

Users that are interested in RFA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pabarbosa / prosody-scripts
View on GitHub
☆16Jan 20, 2026Updated 6 months ago
nc-ai / speech
View on GitHub
☆17Aug 27, 2025Updated 10 months ago
PlayVoice / VI-Speaker
View on GitHub
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
☆30Sep 16, 2022Updated 3 years ago
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
zzw922cn / wesinger2
View on GitHub
Synthesized singing voice demos of WeSinger 2 paper.
☆26Feb 20, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
View on GitHub
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Dec 1, 2021Updated 4 years ago
feelins / Python-Data-Innovation
View on GitHub
some basic scripts for linguistics
☆13May 15, 2025Updated last year
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
sarulab-speech / multi-speaker-dgp
View on GitHub
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Mar 23, 2021Updated 5 years ago
vliu15 / adversarial-tts
View on GitHub
End-to-end Text-to-Speech with Generative Adversarial Networks
☆20Feb 6, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
cschaefer26 / StyleMelGAN
View on GitHub
☆10Apr 8, 2024Updated 2 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
k2-fsa / multi_quantization
View on GitHub
☆46Nov 2, 2023Updated 2 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
spring-media / DeepForcedAligner
View on GitHub
☆81Aug 8, 2025Updated 11 months ago
xinjli / alqalign
View on GitHub
multilingual speech aligner
☆78Nov 19, 2023Updated 2 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ropensci / phonfieldwork
View on GitHub
R package for phonetic research and experimenting
☆20Jul 29, 2024Updated last year
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago
PanagiotisP / svs-multiband
View on GitHub
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Jun 18, 2022Updated 4 years ago
revsic / torch-nansy
View on GitHub
Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513
☆64Feb 13, 2023Updated 3 years ago
samsad35 / source-filter-vae
View on GitHub
[SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder
☆46Apr 18, 2023Updated 3 years ago
CODEJIN / VITS_Diffusion
View on GitHub
☆26Sep 22, 2022Updated 3 years ago
sarulab-speech / lightweight_spkr_anon
View on GitHub
Lightweight speaker anonymization [IEEE SLT2021]
☆27Jun 6, 2022Updated 4 years ago
yl4579 / PitchExtractor
View on GitHub
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
☆151Aug 22, 2022Updated 3 years ago
keonlee9420 / FastPitchFormant
View on GitHub
PyTorch Implementation of NCSOFT's FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis
☆74Aug 3, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jinny1208 / All-About-Speech
View on GitHub
☆14Apr 2, 2023Updated 3 years ago
NingMiao / InteL-VAEs
View on GitHub
Codes for paper <InteL-VAEs: Adding Inductive Biases to VariationalAuto-Encoders via Intermediary Latents>.
☆18Jun 25, 2021Updated 5 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 4 months ago
google-deepmind / librispeech-long
View on GitHub
LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …
☆99Dec 28, 2024Updated last year
CMsmartvoice / Unet-TTS
View on GitHub
One-shot TTS with Improved Unseen Speaker and Style Transfer
☆37Mar 2, 2022Updated 4 years ago
jlian2 / Robust-Voice-Style-Transfer
View on GitHub
Demo for 2022 ICASSP
☆64Jun 14, 2022Updated 4 years ago
CCTN-BCI / Neural2Speech
View on GitHub
code and speech demo for speech reconstruction from ECoG recordings
☆12May 21, 2025Updated last year