Viterbi decoding in PyTorch
☆41Sep 10, 2025Updated 6 months ago
Alternatives and similar repositories for torbi
Users that are interested in torbi are comparing it to the libraries listed below
Sorting:
- Prosody and Pronunciation Modification Network☆63May 5, 2025Updated 10 months ago
- Full models and training code for PESTO☆76Jun 12, 2024Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆122Feb 23, 2025Updated last year
- ☆23Aug 4, 2025Updated 7 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆28Apr 23, 2024Updated last year
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- ☆67Aug 16, 2023Updated 2 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Jun 20, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated 2 years ago
- Experiments from the paper "Sinusoidal Frequency Estimation by Gradient Descent"☆61Mar 8, 2023Updated 3 years ago
- Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decou…☆14May 6, 2025Updated 10 months ago
- A differentiable version of SPTK☆196Feb 26, 2026Updated 3 weeks ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 4 months ago
- Pitch Estimating Neural Networks (PENN)☆271Apr 2, 2025Updated 11 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- An ODE-based generative neural vocoder using Rectified Flow☆58Apr 29, 2023Updated 2 years ago
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆22Jun 10, 2024Updated last year
- ☆12Nov 7, 2024Updated last year
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆147Aug 22, 2022Updated 3 years ago
- Reproducible Subjective Evaluation☆61Mar 3, 2024Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- Differentiable dynamic range controller in PyTorch.☆52Feb 10, 2026Updated last month
- Pytorch implementation of BigVSAN☆203Dec 9, 2025Updated 3 months ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆67Jan 7, 2023Updated 3 years ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Adaptive Vocoder for Custom Voice☆61Sep 22, 2022Updated 3 years ago
- ☆19May 2, 2024Updated last year
- GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch☆136Feb 3, 2025Updated last year
- Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis (Meso-DTFA)☆21Jul 6, 2023Updated 2 years ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆20Feb 9, 2025Updated last year