idiap / atco2-corpusLinks
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
☆67Updated 2 years ago
Alternatives and similar repositories for atco2-corpus
Users that are interested in atco2-corpus are comparing it to the libraries listed below
Sorting:
- ☆40Updated last year
- ☆16Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 3 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆78Updated last year
- A TTS model that makes a speaker speak new languages☆76Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆103Updated 10 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆94Updated 7 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆88Updated last year
- ☆34Updated last year
- SelfRemaster: SSL Speech Restoration☆89Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆32Updated 2 years ago
- asr2k☆52Updated last year
- ☆43Updated 11 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 3 months ago
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.☆59Updated 3 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆176Updated last year
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆51Updated last year
- A sequence-to-sequence voice conversion toolkit.☆102Updated last year
- Online streaming speaker change detection model in Pytorch☆42Updated 2 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆159Updated 3 weeks ago
- ☆64Updated last year
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆43Updated 5 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 6 months ago
- ☆19Updated last year
- Collection of scripts from mHuBERT-147.☆29Updated 9 months ago
- This is the M-AILABS Speech Dataset☆78Updated 9 months ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆17Updated 2 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Updated 3 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆59Updated last year