idiap / atco2-corpus
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
☆56Updated last year
Alternatives and similar repositories for atco2-corpus:
Users that are interested in atco2-corpus are comparing it to the libraries listed below
- ☆35Updated 7 months ago
- ☆14Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆80Updated last year
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆61Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆47Updated 8 months ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆76Updated last year
- A simple package for Guided source separation (GSS)☆116Updated 9 months ago
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- ☆36Updated 5 months ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆49Updated 9 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆58Updated 2 weeks ago
- Clustering-based methods for overlapping diarization☆76Updated last year
- ☆63Updated 5 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆143Updated last year
- Repository for Accent Recognition (Hackathon @SLT2022)☆25Updated 9 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆82Updated last month
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆50Updated 2 weeks ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 9 months ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆133Updated 5 months ago
- ☆31Updated 11 months ago
- ☆39Updated last year
- ☆43Updated 2 weeks ago
- This is the M-AILABS Speech Dataset☆42Updated 3 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆107Updated last year
- Streaming Audiotransformers for online Audio tagging☆43Updated 8 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆19Updated 4 months ago
- ☆19Updated last year