idiap / w2v2-air-trafficView external linksLinks
This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
☆41Jul 10, 2024Updated last year
Alternatives and similar repositories for w2v2-air-traffic
Users that are interested in w2v2-air-traffic are comparing it to the libraries listed below
Sorting:
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆16Dec 1, 2022Updated 3 years ago
- A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications☆82Mar 24, 2023Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 2 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Dec 20, 2022Updated 3 years ago
- Applying Large-Scale Weakly-Supervised Automatic Speech Recognition to Air Traffic Control☆41Nov 29, 2023Updated 2 years ago
- ☆32Dec 4, 2022Updated 3 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Sep 20, 2021Updated 4 years ago
- TTS Android demo of PaddleSpeech, merged into https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos☆28Nov 30, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Pytorch implementation of "spectro-temporal attention-based voice activity detection"☆13Jun 4, 2024Updated last year
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- ☆32Jan 6, 2022Updated 4 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆81May 6, 2021Updated 4 years ago
- Repository for Accent Recognition (Hackathon @SLT2022)☆38May 12, 2024Updated last year
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- Both audio-only and audio-visual speaker diarization datasets are listed here.☆14Feb 22, 2023Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- a repository for trainabale tts multi speaker☆14Nov 28, 2021Updated 4 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 5, 2025Updated last year
- Train a fiwGAN or ciwGAN model using your own training data☆14Oct 13, 2022Updated 3 years ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).☆15Jun 30, 2023Updated 2 years ago