AppleHolic/PytorchSR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AppleHolic/PytorchSR)

AppleHolic / PytorchSR

Pytorch based phoneme recognition (TIMIT phoneme classification)

☆35

Alternatives and similar repositories for PytorchSR

Users that are interested in PytorchSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JoergFranke / phoneme_recognition
View on GitHub
Phoneme Recognition using RecNet
☆97Nov 22, 2016Updated 9 years ago
tbornt / phoneme_ctc
View on GitHub
Bidirectional dynamic RNN + CTC for phoneme recognition
☆47Jun 24, 2020Updated 6 years ago
twidddj / vqvae
View on GitHub
Tensorflow implementation of VQVAE for voice conversion
☆12Apr 3, 2018Updated 8 years ago
Suhee05 / Zerospeech2019
View on GitHub
VQVAE for Unsupervised Voice Conversion
☆21Apr 25, 2019Updated 7 years ago
fengxin-bupt / Application-of-Word2vec-in-Phoneme-Recognition
View on GitHub
Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.
☆30Dec 18, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JeremyCCHsu / vc-vawgan
View on GitHub
Network specification and demo
☆35Jun 5, 2017Updated 9 years ago
anjandeepsahni / speech_phoneme_prediction
View on GitHub
Phoneme prediction from speech mel-spectrograms using RNN.
☆15Jun 4, 2019Updated 7 years ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
HanSeokhyeon / Deep_learning_for_Phoneme_recognition
View on GitHub
다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.
☆13Nov 27, 2019Updated 6 years ago
vsimkus / vae-voice-conversion
View on GitHub
Voice conversion (VC) investigation using three variants of VAE
☆59Oct 28, 2019Updated 6 years ago
r9y9 / pysinsy
View on GitHub
Python wrapper for Sinsy
☆53Oct 9, 2023Updated 2 years ago
chenllliang / CTDNN
View on GitHub
MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
☆11Dec 4, 2021Updated 4 years ago
weedwind / CTC-speech-recognition
View on GitHub
This is a working example of using CTC for phone recognition on TIMIT
☆50Oct 19, 2017Updated 8 years ago
domerin0 / rnn-speech
View on GitHub
Character level speech recognizer using ctc loss with deep rnns in TensorFlow.
☆78Jun 9, 2018Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Mmiglio / SpeechRecognition
View on GitHub
Small-footprint Keyword Spotting
☆18Jul 28, 2019Updated 6 years ago
Auroraaa86 / LCS-CTC
View on GitHub
For IEEE ASRU(2025)
☆15Jun 21, 2025Updated last year
hyama5 / vae_align
View on GitHub
Alignment examples for Interspeech 2024
☆28Jul 5, 2024Updated 2 years ago
sdrobert / pydrobert-kaldi
View on GitHub
SWIG bindings for Kaldi I/O, built with Conda
☆15Dec 15, 2024Updated last year
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
bigpon / QPPWG
View on GitHub
Quasi-Periodic Parallel WaveGAN Pytorch implementation
☆46Oct 29, 2022Updated 3 years ago
npuichigo / extract_features_using_world
View on GitHub
using world vocoder to extract features and make data for training neural networks
☆11Oct 9, 2017Updated 8 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
cjerry1243 / TransferLearning-CLVC
View on GitHub
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
☆40Oct 22, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
NeelayS / speech_spike_signatures
View on GitHub
Spiking neural networks (SNNs) for speech classification
☆12Mar 14, 2022Updated 4 years ago
ihp-lab / Speaker-Invariant-Domain-Adversarial-Neural-Networks
View on GitHub
☆11Sep 29, 2020Updated 5 years ago
cyhuang-tw / AdaIN-VC
View on GitHub
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Norm…
☆119May 27, 2021Updated 5 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
chomeyama / UnifiedSourceFilterGAN
View on GitHub
☆20Jun 5, 2022Updated 4 years ago
taylorlu / AudioKWS
View on GitHub
Audio Keyword Search
☆12May 5, 2019Updated 7 years ago
fantasy-fish / Generalized-Speech-Animation
View on GitHub
USC CS621 Course Project
☆26Apr 22, 2023Updated 3 years ago
verrannt / snn_speechrec
View on GitHub
Convolutional Spiking Neural Network to recognize speech utterances using Spike-Timing-Dependent Plasticity
☆10Mar 9, 2021Updated 5 years ago
zw76859420 / ASR_Phone
View on GitHub
以音素建模构建NN-CTC声学模型
☆16May 14, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
stoneMo / ASVspoof
View on GitHub
☆19Dec 8, 2020Updated 5 years ago
shaojinding / GroupLatentEmbedding
View on GitHub
Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…
☆28Sep 17, 2019Updated 6 years ago
Magicboomliu / Viseme-Classification
View on GitHub
A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification
☆15Jan 19, 2021Updated 5 years ago
jefflai108 / Unsupervised-TTS
View on GitHub
☆42Mar 25, 2022Updated 4 years ago
255BITS / vocal-autoencoder
View on GitHub
☆12May 12, 2016Updated 10 years ago
MurakumoIndustries / murakumoindustries.github.io
View on GitHub
☆13Jul 10, 2025Updated last year
jarfo / gcommands
View on GitHub
Speech Commands Recognition using end-to-end deep learning models in pytorch
☆28Oct 8, 2020Updated 5 years ago