idiap / atco2-corpus
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
☆61Updated 2 years ago
Alternatives and similar repositories for atco2-corpus
Users that are interested in atco2-corpus are comparing it to the libraries listed below
Sorting:
- ☆37Updated 10 months ago
- ☆15Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆50Updated 10 months ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆22Updated last month
- Implementation of Google's USM speech model in Pytorch☆31Updated last month
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆63Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- ☆78Updated last year
- Transcribing Speech with Multinomial Diffusion, training code and models.☆76Updated last year
- A TTS model that makes a speaker speak new languages☆76Updated 10 months ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- ☆73Updated 3 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆87Updated 4 months ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆52Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated last week
- Applying Large-Scale Weakly-Supervised Automatic Speech Recognition to Air Traffic Control☆30Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- ☆50Updated last month
- Clustering-based methods for overlapping diarization☆81Updated last year
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆68Updated 2 years ago
- Various speech datasets made available to the public☆117Updated 4 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆101Updated 3 months ago
- ☆63Updated 3 weeks ago
- A simple package for Guided source separation (GSS)☆121Updated 11 months ago
- ☆38Updated 7 months ago
- ConMamba for Automatic Speech Recognition☆72Updated 8 months ago