CoEDL / vad-sli-asrLinks
A pipeline to isolate and transcribe one language in mixed-language speech
☆18Updated 2 years ago
Alternatives and similar repositories for vad-sli-asr
Users that are interested in vad-sli-asr are comparing it to the libraries listed below
Sorting:
- Repository for Accent Recognition (Hackathon @SLT2022)☆32Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆28Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- PyTorch based speaker embedding model☆16Updated last year
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Workflow for forced alignment between languages☆19Updated last year
- A handy dataset of noises for ASR☆21Updated 6 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Updated 2 years ago
- Differentiable Mean Opinion Score Regularization for Perceptual Speech Enhancement☆23Updated 2 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆27Updated last year
- ☆10Updated 3 years ago
- Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model☆32Updated last year
- ☆25Updated 3 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆42Updated 3 years ago
- ☆24Updated 2 months ago
- ☆13Updated 11 months ago
- ☆17Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- ☆14Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated last month
- ☆56Updated 2 years ago
- Deep Speech Distances PyTorch☆29Updated 3 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆16Updated 3 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆21Updated 3 weeks ago
- ☆37Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆54Updated 2 years ago