MarkWuNLP/SemanticMask

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MarkWuNLP/SemanticMask)

MarkWuNLP / SemanticMask

The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"

☆39

Alternatives and similar repositories for SemanticMask

Users that are interested in SemanticMask are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
HawkAaron / E2E-ASR
View on GitHub
PyTorch Implementations for End-to-End Automatic Speech Recognition
☆127Jun 10, 2019Updated 7 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
oshindow / Transformer-Transducer
View on GitHub
A pytorch_lightning reimplementation of the Transducer module from ESPnet.
☆78Mar 11, 2021Updated 5 years ago
ImperialNLP / MMT-Delib
View on GitHub
☆10Dec 21, 2022Updated 3 years ago
cornerfarmer / ctc_segmentation
View on GitHub
Segment a given audio into utterances using a trained end-to-end ASR model.
☆75Oct 9, 2020Updated 5 years ago
eastonYi / Unsupervised-ASR
View on GitHub
unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
HaoranMiao / streaming-attention
View on GitHub
streaming attention networks for end-to-end automatic speech recognition
☆56May 6, 2020Updated 6 years ago
ShigekiKarita / espnet-semi-supervised
View on GitHub
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…
☆38Feb 13, 2020Updated 6 years ago
idiap / apam
View on GitHub
APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…
☆14Feb 15, 2021Updated 5 years ago
mt-upc / iwslt-2021
View on GitHub
Systems submitted to IWSLT 2021 by the MT-UPC group.
☆14Feb 23, 2023Updated 3 years ago
thu-spmi / CAT
View on GitHub
CAT is more than a CRF-based ASR toolkit: it provides a complete workflow for data-efficient end-to-end ASR, supporting CTC, CTC-CRF, RNN…
☆368Feb 5, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
b-sigpro / spectral-feature-compression
View on GitHub
The source code for Input-Adaptive Spectral Feature Compression by Sequence Modeling for Source Separation published in IEEE TASLPRO.
☆18Jun 3, 2026Updated last month
iamjanvijay / rnnt_decoder_cuda
View on GitHub
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
☆67Jan 7, 2026Updated 6 months ago
shanguanma / Aligners
View on GitHub
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
joshua-decoder / fisher-callhome-corpus
View on GitHub
The Fisher and CALLHOME Spanish–English Speech Translation Corpus
☆41Feb 10, 2022Updated 4 years ago
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
jctian98 / e2e_lfmmi
View on GitHub
E2E system with LF-MMI; word N-gram for Mandarin
☆167Apr 29, 2022Updated 4 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
eastonYi / end-to-end_asr_pytorch
View on GitHub
Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch
☆23Jul 28, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
YiwenShaoStephen / pychain_example
View on GitHub
☆48Jan 8, 2021Updated 5 years ago
HawkAaron / RNN-Transducer
View on GitHub
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
☆140Jun 7, 2021Updated 5 years ago
ZhengkunTian / OpenTransformer
View on GitHub
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
☆378Jul 21, 2022Updated 4 years ago
HawkAaron / warp-transducer
View on GitHub
A fast parallel implementation of RNN Transducer.
☆314Jun 7, 2023Updated 3 years ago
1ytic / warp-rnnt
View on GitHub
CUDA-Warp RNN-Transducer
☆216Feb 22, 2023Updated 3 years ago
gentaiscool / end2end-asr-pytorch
View on GitHub
End-to-End Automatic Speech Recognition on PyTorch
☆304Jun 2, 2022Updated 4 years ago
tencent-ailab / pika
View on GitHub
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
☆354Dec 25, 2020Updated 5 years ago
placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
jfainberg / lattice_combination
View on GitHub
Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices
☆16Mar 19, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rwth-i6 / returnn-experiments
View on GitHub
experiments with RETURNN
☆162Jun 18, 2026Updated last month
craffel / mocha
View on GitHub
Example implementation of Monotonic Chunkwise Attention.
☆54Feb 23, 2018Updated 8 years ago
wangkenpu / WSJ2WAV
View on GitHub
Convert WSJ sphere format to waveform and do data simulation.
☆16Feb 20, 2020Updated 6 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
Sundy1219 / eesen-for-thchs30
View on GitHub
ASR for Chinese Mandarin
☆76Jun 1, 2018Updated 8 years ago
tencent-ailab / 3m-asr
View on GitHub
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
☆119Jun 22, 2022Updated 4 years ago
Alexander-H-Liu / NPC
View on GitHub
Non-Autoregressive Predictive Coding
☆51Nov 3, 2020Updated 5 years ago