georgian-io / Knowledge-Distillation-Toolkit
[DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.
☆137Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for Knowledge-Distillation-Toolkit
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆210Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 3 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆69Updated 2 years ago
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆244Updated last year
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆70Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆110Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- Example code for a neural transducer model.☆60Updated 9 months ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆130Updated last year
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆89Updated 5 months ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 3 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆29Updated 4 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated last year
- Official Implementation of Mockingjay in Pytorch☆52Updated last year
- The official repository for Audio ALBERT☆64Updated 2 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆57Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- ☆74Updated 3 years ago
- Official code for Wav2Seq☆95Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆47Updated last year
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- Various speech datasets made available to the public☆99Updated last month
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆36Updated last year
- Non-Autoregressive Predictive Coding☆50Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- A python library for voice activity detection (VAD) for speech/non-speech segmentation.☆83Updated 2 years ago