jreremy / conformerView external linksLinks
Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.
☆28May 1, 2024Updated last year
Alternatives and similar repositories for conformer
Users that are interested in conformer are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition☆18Apr 25, 2021Updated 4 years ago
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆19Jun 21, 2023Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated 11 months ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 7 years ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Nov 5, 2020Updated 5 years ago
- a optional way to extract audio feature☆13Jun 10, 2017Updated 8 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- Lightweight PDF Q&A tool powered by RAG (Retrieval-Augmented Generation) with MCP (Model Context Protocol) Support.☆21Oct 27, 2025Updated 3 months ago
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆20Apr 3, 2024Updated last year
- Official implementation of Transpotter, published in BMVC 2021☆16Aug 6, 2022Updated 3 years ago
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Apr 27, 2024Updated last year
- A PyTorch Implementation of "Attention Is All You Need"☆38Oct 3, 2021Updated 4 years ago
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆23Aug 20, 2023Updated 2 years ago
- An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper☆21Aug 16, 2022Updated 3 years ago
- [ICASSPW] A Vector Quantized Masked AutoEncoder for speech emotion recognition☆29Mar 4, 2024Updated last year
- 양재 AI 실무자 교육 6조 프로젝트☆21Sep 18, 2018Updated 7 years ago
- Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…☆64Apr 2, 2020Updated 5 years ago
- ☆33Jan 14, 2023Updated 3 years ago
- ☆39Nov 22, 2024Updated last year
- Addressing the problem of predicting crime occurrence based on historic records☆11Nov 27, 2019Updated 6 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- AlignNet: A Unifying Approach to Audio-Visual Alignment (WACV 2020)☆34Jan 10, 2021Updated 5 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- ☆13Jun 18, 2025Updated 7 months ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆35Jun 20, 2023Updated 2 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆149Nov 22, 2022Updated 3 years ago
- Implementation of the convolutional module from the Conformer paper, for use in Transformers☆433May 17, 2023Updated 2 years ago
- Mitigating the Filter Bubble while Maintaining Relevance: Targeted Diversification with VAE-based Recommender Systems☆10Mar 15, 2023Updated 2 years ago
- The official PyTorch implementation of "An Attentional Multi-scale Co-evolving Model for Dynamic Link Prediction" (TheWebConf'23)☆11May 4, 2023Updated 2 years ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆45Nov 5, 2025Updated 3 months ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Mar 4, 2021Updated 4 years ago
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 4 months ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year