idiap/w2v2-air-traffic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/idiap/w2v2-air-traffic)

idiap / w2v2-air-traffic

This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)

☆42

Alternatives and similar repositories for w2v2-air-traffic

Users that are interested in w2v2-air-traffic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

idiap / bert-text-diarization-atc
View on GitHub
This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
☆17Dec 1, 2022Updated 3 years ago
idiap / atco2-corpus
View on GitHub
A Corpus for Research on Robust Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications
☆89Mar 24, 2023Updated 3 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
jlvdoorn / WhisperATC
View on GitHub
Applying Large-Scale Weakly-Supervised Automatic Speech Recognition to Air Traffic Control
☆45Nov 29, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
CODEJIN / VITS_Diffusion
View on GitHub
☆26Sep 22, 2022Updated 3 years ago
Yifei-ZHAO96 / STAM-pytorch
View on GitHub
Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated 2 years ago
sarulab-speech / whisper-asr-finetune
View on GitHub
☆32Dec 4, 2022Updated 3 years ago
vtuber-plan / hifi-gan
View on GitHub
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
☆32Apr 10, 2023Updated 3 years ago
bayartsogt-ya / whisper-multiple-hf-datasets
View on GitHub
Whisper fine-tuning event script to use multiple hf datasets
☆32Dec 20, 2022Updated 3 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
asteroid-team / Libri_VAD
View on GitHub
Script to generate VAD dataset used in Asteroid recipe
☆21Sep 30, 2021Updated 4 years ago
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
hecko-yes / tts-dataset-prompts
View on GitHub
Finally, some decent sample sentences
☆24Dec 3, 2023Updated 2 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
mzboito / IWSLT2022_Tamasheq_data
View on GitHub
Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…
☆18Nov 30, 2022Updated 3 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
pengzhendong / welm
View on GitHub
One command to build TLG.fst for WeNet.
☆30Oct 11, 2022Updated 3 years ago
LeonWlw / asr_blockformer
View on GitHub
E2E ASR system
☆14Oct 20, 2022Updated 3 years ago
doerlbh / MiniVox
View on GitHub
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
☆29Sep 20, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 5 years ago
alumae / voxlingua107_sb
View on GitHub
VoxLingua107 recipe for SpeechBrain
☆13Jul 3, 2021Updated 5 years ago
richardbaihe / a3t
View on GitHub
Code for paper A3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
☆89Sep 6, 2024Updated last year
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
joann8512 / SusPedal-Gen
View on GitHub
This is the repository for Learning to Generate Piano Music With Sustain Pedals
☆12Nov 23, 2023Updated 2 years ago
JuanPZuluaga / accent-recog-slt2022
View on GitHub
Repository for Accent Recognition (Hackathon @SLT2022)
☆43May 12, 2024Updated 2 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
atosystem / MusicChain
View on GitHub
🎹🎵🎶 A platform to make Original and Cover Visible and Valuable.
☆14Nov 8, 2022Updated 3 years ago
sooftware / RNN-Transducer
View on GitHub
PyTorch implementation of RNN-Transducer(RNN-T).
☆81May 6, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
spring-media / DeepForcedAligner
View on GitHub
☆81Aug 8, 2025Updated 11 months ago
jasonppy / word-discovery
View on GitHub
Word Discovery in Visually Grounded, Self-Supervised Speech Models
☆27Dec 4, 2023Updated 2 years ago
RemiRigal / snreval-python
View on GitHub
This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…
☆12Jun 22, 2022Updated 4 years ago
othmar52 / midi2video
View on GitHub
convert midi file to piano video with highlighted keys
☆11Mar 26, 2021Updated 5 years ago
MTG / PodcastMix-inference
View on GitHub
☆32Jan 6, 2022Updated 4 years ago
jumon / whisper-punctuator
View on GitHub
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆120Feb 4, 2023Updated 3 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago