ondrejklejch / learning_to_adapt
Coordinate-wise meta-learner for speaker adaptation of ASR models.
☆20Updated 4 years ago
Related projects: ⓘ
- VoxSRC Challenge☆31Updated 5 years ago
- PyTorch implementation of a self-attentive speaker embedding☆16Updated 4 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆70Updated 4 years ago
- Region proposal network based small-footprint keyword spotting (Pytorch)☆51Updated 10 months ago
- ☆98Updated 6 years ago
- Old language modeling tool that's used in kaldi☆16Updated last year
- Pytorch implementation of "Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion" [Intersp…☆28Updated 5 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆35Updated 4 years ago
- ☆20Updated 5 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆47Updated 5 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆62Updated 5 years ago
- Recurrent Neural Aligner☆49Updated 4 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- Gaussian Mixture VAE Tacotron☆52Updated last year
- PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)☆76Updated 6 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 5 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆52Updated last year
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Updated 4 years ago
- ☆55Updated 4 years ago
- Dataset and baseline for the first Audiocaption task☆78Updated last month
- E2E-SincNet: Toward fully end-to-end speech recognition☆29Updated 4 years ago
- ☆16Updated 5 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆37Updated 4 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆55Updated 4 years ago
- Example implementation of Monotonic Chunkwise Attention.☆49Updated 6 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 4 years ago