An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper
☆20Aug 16, 2022Updated 3 years ago
Alternatives and similar repositories for Conformer
Users that are interested in Conformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆24Jul 21, 2024Updated last year
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆16Mar 4, 2022Updated 4 years ago
- SASV2 baseline, a track on ASVspoof5 phase2 challenge☆27Nov 12, 2025Updated 6 months ago
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆29Jul 9, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Jul 31, 2023Updated 2 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14May 15, 2022Updated 4 years ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Mar 4, 2022Updated 4 years ago
- [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)☆1,123Jan 5, 2026Updated 4 months ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆11Apr 6, 2020Updated 6 years ago
- Calculator Tool of Word Error Rate and Character Error Rate☆14Nov 3, 2020Updated 5 years ago
- ☆13May 14, 2021Updated 5 years ago
- Implementation of the convolutional module from the Conformer paper, for use in Transformers☆438May 17, 2023Updated 3 years ago
- A fine-tuned Large Language Model (LLM) for the Vietnamese language based on the Llama 2 model.☆18Sep 12, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆41Aug 29, 2024Updated last year
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- Cross attentive pooling for speaker verification (IEEE SLT, 2021)☆12Dec 14, 2020Updated 5 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆77Aug 15, 2021Updated 4 years ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆29May 1, 2024Updated 2 years ago
- This is an official PyTorch code for our accepted paper "When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-w…☆15Jul 7, 2022Updated 3 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆99May 30, 2025Updated 11 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆114Feb 27, 2022Updated 4 years ago
- Adaptive Sparse ViT☆16Aug 1, 2023Updated 2 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2021☆18Jul 21, 2021Updated 4 years ago
- Torch implementation of ViT based classifier for Audio classification☆12May 22, 2022Updated 4 years ago
- 2022 DCASE Challenge☆14Sep 30, 2024Updated last year
- ☆16Mar 29, 2022Updated 4 years ago
- Python toolkit for speech processing☆72Updated this week
- Python implementation of time varying filter EMD☆14Mar 3, 2024Updated 2 years ago
- ☆15Oct 15, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official implementation of Hierarchical Spectrogram Transformers (HST)☆20Oct 10, 2022Updated 3 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- Vietnamese diacritics restoration☆13Jan 18, 2016Updated 10 years ago
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Nov 11, 2021Updated 4 years ago
- ☆12Dec 30, 2020Updated 5 years ago
- [IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast☆21Oct 25, 2023Updated 2 years ago
- ☆14Dec 8, 2022Updated 3 years ago