bagustris / w2v2-vadView external linksLinks
A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition
☆18Aug 9, 2023Updated 2 years ago
Alternatives and similar repositories for w2v2-vad
Users that are interested in w2v2-vad are comparing it to the libraries listed below
Sorting:
- ☆11Nov 5, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Repository for my paper: Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recog…☆20Mar 13, 2024Updated last year
- ☆13Oct 27, 2021Updated 4 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 5, 2025Updated last year
- Train a fiwGAN or ciwGAN model using your own training data☆14Oct 13, 2022Updated 3 years ago
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Oct 24, 2023Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- ☆15Nov 5, 2021Updated 4 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- A neural language modeling toolkit built on PyTorch☆19Mar 17, 2023Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- Vim Speech Recognition Experiments☆20May 30, 2025Updated 8 months ago
- Acoustic Neighbor Embeddings☆29Jul 13, 2025Updated 7 months ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 4 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- Multistream CNN for Robust Acoustic Modeling☆40Jun 17, 2021Updated 4 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- Docker for building an environment for Dutch online and offline ASR.☆12Feb 2, 2021Updated 5 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆46Jan 7, 2026Updated last month
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- ☆25Jun 14, 2022Updated 3 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Sep 20, 2021Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Transfer learning approach to pronunciation scoring☆11Jan 17, 2024Updated 2 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 4 months ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago