pika-online / AESRC2020
a deep accent recognition network
☆48Updated 3 years ago
Alternatives and similar repositories for AESRC2020:
Users that are interested in AESRC2020 are comparing it to the libraries listed below
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Updated 3 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆58Updated 3 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated last month
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 5 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆22Updated 3 years ago
- The official repository for Audio ALBERT☆64Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆29Updated 4 months ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 5 years ago
- ☆53Updated 4 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- ☆25Updated 3 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- ☆29Updated 2 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 5 years ago
- ☆36Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- The python implementation for paper "Towards Discriminative Representation Learning for Speech Emotion Recognition" in IJCAI-2019☆23Updated 5 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆29Updated 5 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- This is a implementation of kaldi-plda.☆15Updated 6 years ago