Popgun-Labs / SincNetConv
A PyTorch 1.0 implementation of the convolutions described in SincNet
☆32Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for SincNetConv
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- ☆55Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated 2 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆29Updated 4 years ago
- ☆36Updated 3 years ago
- experiments about AudioSet☆43Updated last year
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- ☆41Updated last year
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- Discriminative Condition-Aware PLDA☆42Updated 3 months ago
- ☆36Updated 2 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆36Updated 2 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement☆73Updated 3 years ago
- ☆17Updated 5 years ago
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- Download and create a tfreader for the audioset dataset☆16Updated 4 years ago
- ☆63Updated last year