MLSpeech / speech_yolo
SpeechYOLO Interspeech 2019
☆42Updated 2 years ago
Related projects: ⓘ
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated 11 months ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated 11 months ago
- A packaged convolutional voice activity detector for noisy environments.☆14Updated 5 years ago
- ☆12Updated this week
- The codebase for Data-driven general-purpose voice activity detection.☆93Updated last year
- ☆40Updated 3 weeks ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆41Updated last year
- ☆51Updated this week
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- Implementaion RNN tranceducer☆20Updated 5 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 3 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 2 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆57Updated 3 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 3 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆26Updated 9 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆25Updated 3 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 4 years ago
- Speaker recognition ,Voiceprint recognition☆51Updated 4 years ago
- ☆22Updated 2 years ago
- End-to-end diarization loss☆19Updated 3 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 5 years ago
- End-to-End Speech Recognition Using Tensorflow☆40Updated last year
- 2nd place solution for ID R&D Voice Antispoofing Challenge☆15Updated 5 years ago
- ☆74Updated 2 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- Keras implementations of Tacotron-2☆27Updated 3 years ago