biyoml / PyTorch-End-to-End-ASR-on-TIMIT
Attention-based end-to-end ASR on TIMIT in PyTorch
☆17Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for PyTorch-End-to-End-ASR-on-TIMIT
- Improving Recording Device Generalization using Impulse Response Augmentation☆10Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆69Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆46Updated 3 years ago
- Non-Autoregressive Predictive Coding☆50Updated 4 years ago
- A collection of papers related to speech model compression☆24Updated last year
- ☆18Updated 2 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Updated 3 years ago
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆39Updated last year
- Continual Learning Benchmark for Spoken Keyword Spotting☆16Updated 2 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated 2 years ago
- A two step optimization for sound source separation on the adaptive front-end domain☆67Updated 4 years ago
- ☆27Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- A temporal module for PyTorch-ComplexTensor☆45Updated 4 months ago
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆73Updated 3 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Updated 2 years ago
- ☆53Updated 4 years ago
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆43Updated last year
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆32Updated last year
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Updated last year
- ☆20Updated 3 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆45Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- The official repository for Audio ALBERT☆64Updated 2 years ago
- Asteroid's filterbanks☆80Updated 4 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆19Updated 4 years ago
- A PyTorch 1.0 implementation of the convolutions described in SincNet☆32Updated 5 years ago
- ☆19Updated 5 years ago