vadimkantorov / convasrLinks

Baseline convolutional ASR system in PyTorch

☆21

Alternatives and similar repositories for convasr

Users that are interested in convasr are comparing it to the libraries listed below

Sorting:

ruslan-corpus / ruslan-corpus.github.io
☆21Updated 5 years ago
1ytic / open_stt_e2e
PyTorch end-to-end speech recognition
☆49Updated 4 years ago
Idlak / Living-Audio-Dataset
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆41Updated 3 years ago
Kyubyong / specAugment
Tensor2tensor experiment with SpecAugment
☆46Updated 6 years ago
anyks / alm
Smart Language Model
☆46Updated 2 years ago
nsu-ai-team / russian_g2p_neuro
Experiments with grapheme2phoneme for Russian based on the artificial neural networks
☆20Updated 4 years ago
AIRI-Institute / AI4TALK
☆13Updated 2 years ago
artbataev / end2end
Losses and decoders for end-to-end ASR and OCR
☆34Updated 4 years ago
alxmamaev / ultimate_tts
☆13Updated 3 years ago
EMRAI / emrai-synthetic-diarization-corpus
☆20Updated 6 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
☆12Updated 4 years ago
pilot7747 / VoxDIY
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Updated 4 years ago
m-wiesner / nnet_pytorch
Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.
☆26Updated last year
revdotcom / words2num
Convert words to numbers
☆20Updated 3 years ago
isca-sig-rosp / ISCA-SIG-RoSP
Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)
☆11Updated last year
jfainberg / lattice_combination
Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices
☆16Updated last year
bsxfan / meta-embeddings
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆22Updated 6 years ago
joaoantoniocn / AM-MobileNet1D
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…
☆30Updated last year
burrmill / burrmill
BurrMill core
☆21Updated 3 years ago
beer-asr / beer
Bayesian spEEch Recognizer
☆55Updated 4 years ago
shane-settle / neural-acoustic-word-embeddings
☆45Updated 6 years ago
kate-egorova / ASR-hybrid-decoding
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Updated 5 years ago
asappresearch / multistream-cnn
Multistream CNN for Robust Acoustic Modeling
☆40Updated 4 years ago
luomingshuang / k2-speechbrain
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Updated 3 years ago
markusdr / transducersaurus
Automatically exported from code.google.com/p/transducersaurus
☆11Updated 10 years ago
mmaciej2 / kaldi
This is now the official location of the Kaldi project.
☆13Updated 6 years ago
levtelyatnikov / radiomixer
radiomixer
☆14Updated 3 years ago
yoyolicoris / pytorch_FFTNet
A pytorch implementation of FFTNet.
☆37Updated 6 years ago
besacier / ASR2022
☆56Updated 2 years ago
zh217 / torch-asg
Auto Segmentation Criterion (ASG) implemented in pytorch
☆51Updated 3 years ago