biyoml / PyTorch-End-to-End-ASR-on-TIMIT
Attention-based end-to-end ASR on TIMIT in PyTorch
☆17Updated 2 years ago
Related projects: ⓘ
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆68Updated last year
- Non-Autoregressive Predictive Coding☆50Updated 3 years ago
- ☆16Updated 5 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- Example code for a neural transducer model.☆58Updated 7 months ago
- FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)☆17Updated 10 months ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆10Updated last year
- Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…☆40Updated last year
- Vector Quantized Autoregressive Predictive Coding (VQ-APC)☆35Updated 3 years ago
- Script to perform statistical significance test between ASR hypotheses.☆19Updated 7 years ago
- A PyTorch 1.0 implementation of the convolutions described in SincNet☆32Updated 5 years ago
- ☆20Updated 3 years ago
- A collection of papers related to speech model compression☆24Updated last year
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆26Updated 5 months ago
- The official repository for Audio ALBERT☆64Updated 2 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆35Updated 6 months ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆20Updated 3 years ago
- PyTorch reimplementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆14Updated 3 years ago
- Spectra extraction tutorials based on torch and torchaudio.☆40Updated last year
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Updated last year
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆80Updated 5 years ago
- Making Espnet easier to use☆51Updated 3 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆46Updated 3 years ago
- Few-Shot Keyword Spotting☆53Updated 3 years ago
- ☆21Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated last year
- ☆13Updated last year
- Instructions on downloading and using the LibriAdapt dataset☆44Updated 3 years ago
- A probabilistic scoring backend for length-normalized embeddings.☆10Updated 4 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆13Updated last year