my-yy / s2v_rcView external linksLinks
Speech2Vec Reality Check
☆88Feb 21, 2023Updated 2 years ago
Alternatives and similar repositories for s2v_rc
Users that are interested in s2v_rc are comparing it to the libraries listed below
Sorting:
- ☆15Sep 9, 2021Updated 4 years ago
- Taiwanese Speech Synthesis with Tacotron2☆25Oct 2, 2022Updated 3 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated last year
- Some PKGBUILDs☆12Aug 5, 2025Updated 6 months ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- ☆13Sep 25, 2024Updated last year
- ☆10Apr 17, 2024Updated last year
- ☆31Jul 13, 2023Updated 2 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated last year
- Taiwanese Translation with BERT based model and RNN. Collection of Taiwanese text corpus☆13Oct 15, 2022Updated 3 years ago
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- Code:Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Mod…☆25Dec 17, 2019Updated 6 years ago
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- ☆13Mar 7, 2022Updated 3 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- fine-tune Whipser model for Taiwanese speech recognition☆36Mar 23, 2023Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- A PyTorch implementation of the universal neural vocoder☆67Nov 6, 2020Updated 5 years ago
- ☆41May 15, 2023Updated 2 years ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- ☆100Jul 22, 2021Updated 4 years ago
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- ☆19Mar 22, 2024Updated last year
- some of my dotfiles☆19Dec 30, 2025Updated last month
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆117May 27, 2021Updated 4 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆10Dec 16, 2018Updated 7 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Super Flappy Bird in p5.js☆10Mar 8, 2021Updated 4 years ago