Pytorch based phoneme recognition (TIMIT phoneme classification)
☆35Apr 25, 2018Updated 8 years ago
Alternatives and similar repositories for PytorchSR
Users that are interested in PytorchSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Phoneme Recognition using RecNet☆97Nov 22, 2016Updated 9 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆47Jun 24, 2020Updated 5 years ago
- Tensorflow implementation of VQVAE for voice conversion☆12Apr 3, 2018Updated 8 years ago
- VQVAE for Unsupervised Voice Conversion☆21Apr 25, 2019Updated 7 years ago
- Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.☆30Dec 18, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Network specification and demo☆35Jun 5, 2017Updated 8 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Jun 4, 2019Updated 6 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Nov 27, 2019Updated 6 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Oct 28, 2019Updated 6 years ago
- Python wrapper for Sinsy☆53Oct 9, 2023Updated 2 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Oct 19, 2017Updated 8 years ago
- Character level speech recognizer using ctc loss with deep rnns in TensorFlow.☆78Jun 9, 2018Updated 7 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Small-footprint Keyword Spotting☆18Jul 28, 2019Updated 6 years ago
- Alignment examples for Interspeech 2024☆27Jul 5, 2024Updated last year
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Quasi-Periodic Parallel WaveGAN Pytorch implementation☆46Oct 29, 2022Updated 3 years ago
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 4 years ago
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- ☆16Apr 4, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Sep 29, 2020Updated 5 years ago
- An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…☆118May 27, 2021Updated 4 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ☆20Jun 5, 2022Updated 3 years ago
- USC CS621 Course Project☆26Apr 22, 2023Updated 3 years ago
- Convolutional Spiking Neural Network to recognize speech utterances using Spike-Timing-Dependent Plasticity☆10Mar 9, 2021Updated 5 years ago
- ☆19Dec 8, 2020Updated 5 years ago
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Sep 17, 2019Updated 6 years ago
- ☆12May 12, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆42Mar 25, 2022Updated 4 years ago
- ☆13Jul 10, 2025Updated 9 months ago
- Speech Commands Recognition using end-to-end deep learning models in pytorch☆28Oct 8, 2020Updated 5 years ago
- Python phase-vocoder implementation with pitch shifting and formant correction☆14Feb 17, 2022Updated 4 years ago
- 以音素建模构建NN-CTC声学模型☆15May 14, 2019Updated 6 years ago
- Voice Conversion using Cycle GAN's For Non-Parallel Data☆125Dec 18, 2018Updated 7 years ago
- Audio Keyword Search☆12May 5, 2019Updated 7 years ago