xingchensong/Speech-Transformer-plus-2DAttention

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xingchensong/Speech-Transformer-plus-2DAttention)

xingchensong / Speech-Transformer-plus-2DAttention

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

☆12

Alternatives and similar repositories for Speech-Transformer-plus-2DAttention

Users that are interested in Speech-Transformer-plus-2DAttention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nwpuaslp / ASC_baseline
View on GitHub
☆20Nov 22, 2020Updated 5 years ago
foamliu / Speech-Transformer
View on GitHub
PyTorch re-implementation of Speech-Transformer
☆102Nov 19, 2021Updated 4 years ago
myst-templates / arxiv_two_column
View on GitHub
A two-column template for pre-prints based on the arXiv submission guide.
☆17Sep 10, 2025Updated 10 months ago
yuguochencuc / CinCGAN-SE
View on GitHub
Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement
☆10Jan 24, 2022Updated 4 years ago
taiqing / pinyin2hanzi
View on GitHub
End-to-end translation of Chinese phonetics to characters using bi-directional RNN (LSTM/GRU)
☆29Aug 2, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
andrewyurick / gunshot_recognition
View on GitHub
Gun Model Recognition From Gunshot Audio Project
☆16Jan 11, 2021Updated 5 years ago
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
ZhihaoHu / PyTorchDataCompression
View on GitHub
☆16Nov 29, 2020Updated 5 years ago
jingyonghou / KWS_Max-pooling_RHE
View on GitHub
Mining effective negative training samples for keyword spotting (PyTorch)
☆66May 23, 2020Updated 6 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
IMLHF / WFb_SE
View on GitHub
(tensorflow) Wiener Filter based Speech Enhancement（LSTM/BLSTM, GRU/BGRU, Transformer）
☆15Dec 3, 2019Updated 6 years ago
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pengzhendong / streaming-asr
View on GitHub
One command to start a streaming ASR server.
☆12Oct 2, 2024Updated last year
emiljoswin / Deep-Humor-Generation-Analysis-and-Classification-of-Humor-using-Transformers
View on GitHub
Analyse the self-attention patterns in BERT for humor classification and verify the linguistic theory of humor, use GPT-2 to create humor…
☆11Apr 30, 2020Updated 6 years ago
fy378968174 / GAN-based-speech-enhancement-Keras-
View on GitHub
Keras implementation of speech enhancement based on LSGAN
☆20Dec 10, 2017Updated 8 years ago
Marcovaldong / lstmp.pytorch
View on GitHub
The implementation of LSTM with projection layer by PyTorch
☆17Sep 1, 2019Updated 6 years ago
BYRTIMO / END-TO-END-SPEECH-ENHANCEMENT-BASED-ON-DISCRETE-COSINE-TRANSFORM
View on GitHub
☆18Nov 10, 2019Updated 6 years ago
k2-fsa / sherpa-mlx
View on GitHub
sherpa with mlx
☆15Aug 2, 2025Updated 11 months ago
jiay7 / wenet_onlinedecode
View on GitHub
Went online decode demo
☆31Apr 28, 2021Updated 5 years ago
jzshq208886 / wenet_asr
View on GitHub
☆12Jul 11, 2024Updated 2 years ago
qniksefat / lexitalk
View on GitHub
🤖🎙️ Explore Lex Fridman Podcast Transcripts with a smart chatbot!
☆10Mar 13, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
stephen-harmon-newman / Audio-Denoising
View on GitHub
A CNN for denoising speech.
☆16Jun 2, 2019Updated 7 years ago
JohnsonLee1999 / 2021TJUThesisLatexTemplate
View on GitHub
2021届天津大学最新毕设latex模板。
☆13May 25, 2021Updated 5 years ago
tennisonliu / noise_reduction
View on GitHub
Using Spectral Noise Gating (SNG) techniques to reduce background noise in streaming microphone input for enhanced vocal recognition
☆24Dec 10, 2018Updated 7 years ago
Bartelds / ctc-dro
View on GitHub
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆17May 16, 2025Updated last year
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
foamliu / Listen-Attend-Spell-v2
View on GitHub
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
☆39Jul 25, 2019Updated 7 years ago
aparwal / DeepSeparation
View on GitHub
Keras Implementation and Experiments with Deep Recurrent Neural Networks for Source Separation
☆18May 4, 2018Updated 8 years ago
linan2 / TensorFlow-speech-enhancement
View on GitHub
DNN and RCED speech enhancement
☆20Jan 30, 2024Updated 2 years ago
Searcher408 / DNN-Speech-Enhancement-Task
View on GitHub
An Experimental Study on Speech Enhancement based on DNN.
☆14Aug 11, 2018Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
JosephHuang913 / Turbo-Equalization
View on GitHub
The performance of turbo equalizers in both ISI channel and multipath fading channel is evaluated
☆12Nov 24, 2020Updated 5 years ago
Orkis-Research / Quaternion-Convolutional-Neural-Networks-for-End-to-End-Automatic-Speech-Recognition
View on GitHub
This is the code for the paper 'Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition'. It provides all th…
☆67Jan 24, 2019Updated 7 years ago
gulnazaki / meowify
View on GitHub
A web app and flask server to turn vocals from any youtube song to meows!
☆13Jan 8, 2021Updated 5 years ago
vngasp / RedeNeuralMegaSena
View on GitHub
Implementando uma rede neural com dados dos sorteios da Mega Sena
☆31Jun 5, 2018Updated 8 years ago
benhuryuval / reed-muller-codes-matlab
View on GitHub
A MATLAB function library containing encoders, decoders and weight enumerators for Reed-Muller codes.
☆13Aug 19, 2023Updated 2 years ago
asappresearch / multistream-cnn
View on GitHub
Multistream CNN for Robust Acoustic Modeling
☆40Jun 17, 2021Updated 5 years ago