Pytorch based phoneme recognition (TIMIT phoneme classification)
☆35Apr 25, 2018Updated 7 years ago
Alternatives and similar repositories for PytorchSR
Users that are interested in PytorchSR are comparing it to the libraries listed below
Sorting:
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Jun 24, 2020Updated 5 years ago
- Phoneme Recognition using RecNet☆97Nov 22, 2016Updated 9 years ago
- Tensorflow implementation of VQVAE for voice conversion☆12Apr 3, 2018Updated 7 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.☆29Dec 18, 2019Updated 6 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Jun 4, 2019Updated 6 years ago
- Network specification and demo☆35Jun 5, 2017Updated 8 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Oct 28, 2019Updated 6 years ago
- VQVAE for Unsupervised Voice Conversion☆21Apr 25, 2019Updated 6 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Python wrapper for Sinsy☆53Oct 9, 2023Updated 2 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆14Nov 27, 2019Updated 6 years ago
- Speech to text library for Rhasspy using Kaldi☆15Dec 9, 2023Updated 2 years ago
- Audio Keyword Search☆12May 5, 2019Updated 6 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- ☆22Jul 8, 2019Updated 6 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- ☆42Mar 25, 2022Updated 3 years ago
- Voice Alignment and Conversion with Neural Networks and the WORLD codec.☆20Apr 27, 2019Updated 6 years ago
- ☆20Jun 5, 2022Updated 3 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignme…☆59Mar 9, 2020Updated 5 years ago
- Voice Conversion using Cycle GAN's For Non-Parallel Data☆125Dec 18, 2018Updated 7 years ago
- Small-footprint Keyword Spotting☆18Jul 28, 2019Updated 6 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Speech recognition on the TIMIT (or any other) dataset☆44Nov 2, 2017Updated 8 years ago
- Phone generation model/VAE/GAN/VAE+GAN☆20Jun 26, 2018Updated 7 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Dec 19, 2018Updated 7 years ago
- Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow☆17Jan 19, 2018Updated 8 years ago
- Character level speech recognizer using ctc loss with deep rnns in TensorFlow.☆78Jun 9, 2018Updated 7 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Jan 30, 2019Updated 7 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- A system works on singing voice synthesis☆79Jan 11, 2023Updated 3 years ago