jlvdoorn / WhisperATCLinks
Applying Large-Scale Weakly-Supervised Automatic Speech Recognition to Air Traffic Control
☆34Updated last year
Alternatives and similar repositories for WhisperATC
Users that are interested in WhisperATC are comparing it to the libraries listed below
Sorting:
- A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications☆67Updated 2 years ago
- ☆38Updated last year
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆16Updated 9 months ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 2 years ago
- Whisper finetuning☆14Updated 3 months ago
- ☆16Updated 2 years ago
- E2E ASR system☆14Updated 2 years ago
- ☆15Updated last year
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Updated 11 months ago
- Detecting and correction dysfluencies/stuttering/stammering in audio files☆10Updated 2 years ago
- ☆29Updated last year
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 5 months ago
- ☆50Updated 4 years ago
- ☆40Updated last year
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆22Updated 8 months ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 4 years ago
- ☆10Updated last year
- ☆19Updated last year
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.☆25Updated last year
- Benchmarking different VAD models on AVA-Speech dataset☆16Updated 2 years ago
- MeetEval - A meeting transcription evaluation toolkit☆104Updated last week
- Official repository for LMFCA-Net: A Lightweight Model for Multi-Channel Speech Enhancement with Efficient Narrow-Band and Cross-Band Att…☆19Updated 4 months ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆38Updated 2 months ago
- ☆11Updated 2 weeks ago
- Discriminative Training of VBx Diarization☆25Updated 10 months ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆15Updated last month
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆28Updated 7 months ago