PoKoHA/ASR-Conformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PoKoHA/ASR-Conformer)

PoKoHA / ASR-Conformer

Conformer: Convolution-augmented Transformer for Speech Recognition

☆15

Alternatives and similar repositories for ASR-Conformer

Users that are interested in ASR-Conformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Kirili4ik / QuartzNet-ASR-pytorch
View on GitHub
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
☆16Nov 5, 2020Updated 5 years ago
IS2AI / MultilingualASR
View on GitHub
☆14Aug 9, 2021Updated 4 years ago
IS2AI / Uzbek_ASR
View on GitHub
☆12Aug 9, 2021Updated 4 years ago
stdKonjac / DeepComplexCRN
View on GitHub
☆13Mar 22, 2021Updated 5 years ago
shashankshirol / GeneratingNoisySpeechData
View on GitHub
A repository comprising of code for generation of noisy speech data from clean data using deep learning methods
☆16Jul 12, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IS2AI / Kazakh_ASR
View on GitHub
☆16Aug 1, 2025Updated 11 months ago
dobby-seo / korean-speech-recognition-quartznet
View on GitHub
Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식
☆22Jul 21, 2021Updated 5 years ago
swami1995 / V2V
View on GitHub
Code for "From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers"
☆21Jun 12, 2024Updated 2 years ago
oleges1 / quartznet-pytorch
View on GitHub
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
☆27Jul 16, 2021Updated 5 years ago
AlexK-PL / GST_Tacotron2
View on GitHub
A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. The model has been trained with the English read-speech LJ…
☆10Sep 4, 2023Updated 2 years ago
mariegold / NP-Attack
View on GitHub
☆10Mar 22, 2022Updated 4 years ago
NikolaiKyhne / MambAttention
View on GitHub
Official repository for the paper "MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement" (A…
☆35Mar 25, 2026Updated 3 months ago
cyjie429 / RegO
View on GitHub
Region-Based Optimization in Continual Learning for Audio Deepfake Detection
☆14Dec 17, 2024Updated last year
PoKoHA / Speech_Enhancement-DCCRN
View on GitHub
DCCRN: Deep Complex Convolution Recurrent Network
☆14Nov 26, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tuanio / conformer-rnnt
View on GitHub
Conformer RNN-Transducer
☆14May 25, 2022Updated 4 years ago
Lhx94As / E2E-language-diarization
View on GitHub
Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>
☆19Jan 23, 2022Updated 4 years ago
fyxnl / COA
View on GitHub
This a code for CVPR 2025: CoA: Towards Real Image Dehazing via Compression-and-Adaptation
☆15Apr 9, 2025Updated last year
bootphon / sustained-phonation-features
View on GitHub
Python package for the extraction of speech features for sustained phonation
☆12Aug 10, 2020Updated 5 years ago
Li-Sanze / ID-Card
View on GitHub
给定一张身份证正、反面，识别身份证上的所有文字信息
☆10Sep 4, 2019Updated 6 years ago
robflynnyh / long-context-asr
View on GitHub
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆11Jul 3, 2026Updated 2 weeks ago
petrichorwq / DECRO-dataset
View on GitHub
Deepfake cross-lingual evaluation dataset (DECRO) is constructed to evaluate the influence of language differences on deepfake detection.…
☆16Sep 14, 2023Updated 2 years ago
CLAD23 / CLAD
View on GitHub
☆21Apr 23, 2024Updated 2 years ago
lab260ru / AASIST3
View on GitHub
☆16May 3, 2026Updated 2 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yzyouzhang / Empirical-Channel-CM
View on GitHub
Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure …
☆19Feb 15, 2022Updated 4 years ago
Xinghui-Wu / KENKU
View on GitHub
KENKU: Towards Efficient and Stealthy Black-box Adversarial Attacks against ASR Systems
☆19Oct 3, 2023Updated 2 years ago
NikolaiKyhne / xLSTM-SENet
View on GitHub
Official repository for the paper "xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement" (Accepted to INTERSPEECH 2025)
☆60Aug 28, 2025Updated 10 months ago
bene-ges / nemo_compatible
View on GitHub
useful things that work with NVIDIA NeMo library
☆14Jan 20, 2024Updated 2 years ago
minyoungpark1 / Speech-Enhancement
View on GitHub
Unofficial implementation of SCP-GAN
☆18Jul 4, 2023Updated 3 years ago
upskyy / ContextNet
View on GitHub
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…
☆38Feb 27, 2022Updated 4 years ago
aliyun / alibabacloud-viapi-demo
View on GitHub
alibabacloud-viapi-demo
☆18Jun 17, 2022Updated 4 years ago
SSahuDS / Lipreading-Using-Mutimodal-Speech-Recognition
View on GitHub
Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…
☆15Jul 27, 2023Updated 2 years ago
jaketae / conformer
View on GitHub
PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition
☆18Apr 25, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
WisleyWang / DC-AI-LipReading
View on GitHub
☆11May 31, 2020Updated 6 years ago
EllaBot / true-online-td-lambda
View on GitHub
Implementation of True Online TD(lambda) with a Fourier Basis function approximator.
☆13May 9, 2015Updated 11 years ago
emonosuke / emoASR
View on GitHub
End-to-end MOdeling of ASR (Automatic Speech Recognition)
☆33Feb 16, 2023Updated 3 years ago
luanshiyinyang / ChineseOCR
View on GitHub
端到端的中文场景文字识别。
☆12Jun 27, 2022Updated 4 years ago
YudiDong / GAN-based-E2E-communications-system-for-defense-against-adversarial-attack
View on GitHub
A Robust Adversarial Network-Based End-to-End Communications System With Strong Generalization Ability Against Adversarial Attacks
☆19Jan 21, 2022Updated 4 years ago
Curt-Park / triton-inference-server-practice
View on GitHub
Archives for Triton Inference Server Practices
☆15Feb 28, 2022Updated 4 years ago
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago