silversparro/wav2letter.pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/silversparro/wav2letter.pytorch)

silversparro / wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

☆126

Alternatives and similar repositories for wav2letter.pytorch

Users that are interested in wav2letter.pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

type-a / speechnet
View on GitHub
Automatic Speech Recognition
☆20Aug 24, 2018Updated 7 years ago
talonvoice / wav2train
View on GitHub
automatically align transcribed audio and generate a wav2letter training corpus
☆36Apr 11, 2023Updated 3 years ago
LearnedVector / Wav2Letter
View on GitHub
Speech Recognition model based off of FAIR research paper built using Pytorch.
☆87Dec 11, 2018Updated 7 years ago
talonvoice / wav2letter
View on GitHub
Facebook AI Research Automatic Speech Recognition Toolkit
☆23Mar 13, 2021Updated 5 years ago
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ryanleary / patter
View on GitHub
speech-to-text in pytorch
☆83Mar 14, 2019Updated 7 years ago
1ytic / open_stt_e2e
View on GitHub
PyTorch end-to-end speech recognition
☆50Dec 30, 2020Updated 5 years ago
juliuskunze / speechless
View on GitHub
Speech-to-text based on wav2letter built for transfer learning
☆98Oct 21, 2022Updated 3 years ago
awni / speech
View on GitHub
A PyTorch Implementation of End-to-End Models for Speech-to-Text
☆768Jul 6, 2023Updated 3 years ago
kyama0321 / gammachirpy
View on GitHub
A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)
☆32May 14, 2024Updated 2 years ago
Kyubyong / specAugment
View on GitHub
Tensor2tensor experiment with SpecAugment
☆46May 13, 2019Updated 7 years ago
HawkAaron / E2E-ASR
View on GitHub
PyTorch Implementations for End-to-End Automatic Speech Recognition
☆127Jun 10, 2019Updated 7 years ago
Deepest-Project / Transformer-TTS
View on GitHub
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆64Jul 6, 2023Updated 3 years ago
SeanNaren / deepspeech.pytorch
View on GitHub
Speech Recognition using DeepSpeech2.
☆2,136Dec 13, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
talonvoice / speech
View on GitHub
speech engine training projects
☆29Apr 19, 2021Updated 5 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
mdangschat / ctc-asr
View on GitHub
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
☆123Apr 15, 2020Updated 6 years ago
AIRI-Institute / AI4TALK
View on GitHub
☆13Dec 7, 2022Updated 3 years ago
joaoantoniocn / AM-MobileNet1D
View on GitHub
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…
☆31Oct 3, 2023Updated 2 years ago
ctogle / abae_pytorch
View on GitHub
Attention based aspect extraction via pytorch
☆14Jun 8, 2020Updated 6 years ago
louiskirsch / speechT
View on GitHub
An opensource speech-to-text software written in tensorflow
☆160Oct 15, 2022Updated 3 years ago
CoEDL / kaldi_helpers
View on GitHub
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
☆15May 19, 2020Updated 6 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
dynilib / dynitag
View on GitHub
Collaborative audio annotation tool
☆17Sep 16, 2022Updated 3 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
wiebket / bt4vt
View on GitHub
Bias Tests for Voice Technologies (bt4vt)
☆11Jun 16, 2024Updated 2 years ago
AzizCode92 / Listen-Attend-and-Spell-Pytorch
View on GitHub
Listen Attend and Spell (LAS) implement in pytorch
☆60Sep 4, 2018Updated 7 years ago
jinserk / pytorch-asr
View on GitHub
ASR with PyTorch
☆139Mar 10, 2019Updated 7 years ago
ruslan-corpus / ruslan-corpus.github.io
View on GitHub
☆22Aug 29, 2019Updated 6 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
funcwj / ge2e-speaker-verification
View on GitHub
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
☆103Mar 18, 2019Updated 7 years ago
farisalasmary / deepspeech2-online-decoder
View on GitHub
Online (real-time) decoder to be used with DeepSpeech2 model
☆25Feb 27, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ICLR-DAP / Deep-Audio-Prior
View on GitHub
Anonymous ICLR Submission
☆14Sep 25, 2019Updated 6 years ago
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
jaywalnut310 / waveglow-vqvae
View on GitHub
WaveGlow vocoder with VQVAE
☆61Jun 18, 2019Updated 7 years ago
vivianngo97 / Punctuation_Transcription
View on GitHub
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
☆15Aug 6, 2020Updated 5 years ago
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
lallubharteja / KWS-Scripts
View on GitHub
Keyword Search Recipe for Subword ASR
☆30Jul 12, 2019Updated 7 years ago