Neural network-based forced alignment with bidirectional attention mechanism
☆78Jan 17, 2025Updated last year
Alternatives and similar repositories for NeuFA
Users that are interested in NeuFA are comparing it to the libraries listed below
Sorting:
- Charsiu: A neural phonetic aligner.☆332Sep 19, 2022Updated 3 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated last year
- ☆80Aug 8, 2025Updated 6 months ago
- Parallel waveform generation with DiffusionGAN☆17Mar 26, 2022Updated 3 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- [SpeechCom Journal] Learning and controlling the source-filter representation of speech with a variational autoencoder☆45Apr 18, 2023Updated 2 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆268Jul 29, 2023Updated 2 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆125Jun 16, 2022Updated 3 years ago
- [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations☆142Apr 27, 2024Updated last year
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆150Aug 25, 2023Updated 2 years ago
- ☆24Mar 15, 2022Updated 3 years ago
- Chinese Text Normalization and Dataset☆91May 14, 2022Updated 3 years ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆184Mar 6, 2024Updated last year
- ☆55Aug 11, 2022Updated 3 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- Mason-Alberta Phonetic Segmenter☆15Feb 24, 2026Updated last week
- ☆16Apr 4, 2022Updated 3 years ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆367Sep 3, 2024Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆159Jul 16, 2022Updated 3 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆121Mar 14, 2023Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆223Oct 20, 2023Updated 2 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Nov 1, 2022Updated 3 years ago
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion☆104Jul 12, 2023Updated 2 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- ☆197May 3, 2024Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago