ruanvdmerwe / triplet-entropy-lossLinks
Project repository for the work done in Triplet Entropy Loss: Improving The Generalization of Short Speech Language Identification Systems
☆13Updated 4 years ago
Alternatives and similar repositories for triplet-entropy-loss
Users that are interested in triplet-entropy-loss are comparing it to the libraries listed below
Sorting:
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated 11 months ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks…☆18Updated 3 years ago
- A handy dataset of noises for ASR☆22Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆11Updated 6 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Baseline kaldi script for UA-SPEECH corpus☆31Updated 11 months ago
- Sisyphus recipies for ASR☆17Updated last week
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- ☆17Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated last year
- ☆11Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆10Updated 4 years ago
- ☆24Updated 6 years ago
- ☆17Updated 2 years ago
- ☆16Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆31Updated 3 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆14Updated 9 months ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Updated 2 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Updated 4 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Updated 4 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Updated 5 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Updated 5 years ago
- Mel spectrum based on tacotron2 for melgan speech synthesis☆15Updated 2 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆17Updated 2 years ago