Pytorch based phoneme recognition (TIMIT phoneme classification)
☆35Apr 25, 2018Updated 7 years ago
Alternatives and similar repositories for PytorchSR
Users that are interested in PytorchSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Phoneme Recognition using RecNet☆97Nov 22, 2016Updated 9 years ago
- Tensorflow implementation of VQVAE for voice conversion☆12Apr 3, 2018Updated 7 years ago
- VQVAE for Unsupervised Voice Conversion☆21Apr 25, 2019Updated 6 years ago
- Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.☆29Dec 18, 2019Updated 6 years ago
- Network specification and demo☆35Jun 5, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Jun 4, 2019Updated 6 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Nov 27, 2019Updated 6 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Oct 28, 2019Updated 6 years ago
- Python wrapper for Sinsy☆53Oct 9, 2023Updated 2 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Oct 19, 2017Updated 8 years ago
- Character level speech recognizer using ctc loss with deep rnns in TensorFlow.☆78Jun 9, 2018Updated 7 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Small-footprint Keyword Spotting☆18Jul 28, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Alignment examples for Interspeech 2024☆27Jul 5, 2024Updated last year
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 4 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Quasi-Periodic Parallel WaveGAN Pytorch implementation☆46Oct 29, 2022Updated 3 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- ☆11Sep 29, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆117May 27, 2021Updated 4 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ☆20Jun 5, 2022Updated 3 years ago
- USC CS621 Course Project☆26Apr 22, 2023Updated 2 years ago
- ☆19Dec 8, 2020Updated 5 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Sep 17, 2019Updated 6 years ago
- ☆12May 12, 2016Updated 9 years ago
- ☆42Mar 25, 2022Updated 4 years ago
- ☆13Jul 10, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Speech Commands Recognition using end-to-end deep learning models in pytorch☆28Oct 8, 2020Updated 5 years ago
- Python phase-vocoder implementation with pitch shifting and formant correction☆14Feb 17, 2022Updated 4 years ago
- kubectl-cred is a kubernetes plugin to which switch to contexts and namespaces and clusters using an interactive CLI.☆15Feb 28, 2022Updated 4 years ago
- 以音素建模构建NN-CTC声学模型☆15May 14, 2019Updated 6 years ago
- Voice Conversion using Cycle GAN's For Non-Parallel Data☆125Dec 18, 2018Updated 7 years ago
- Audio Keyword Search☆12May 5, 2019Updated 6 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago