Audio Diarization Annotation tool
☆30Nov 8, 2019Updated 6 years ago
Alternatives and similar repositories for audio_diarization_annotation
Users that are interested in audio_diarization_annotation are comparing it to the libraries listed below
Sorting:
- Transfer learning approach to pronunciation scoring☆11Jan 17, 2024Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- ☆33Nov 27, 2021Updated 4 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- This is application for dysarthria to improve their pronunciation by using deep learning☆10Dec 29, 2020Updated 5 years ago
- ☆11Jun 14, 2024Updated last year
- A C++ library for parsing and manipulating JSGF grammar files.☆14Feb 13, 2024Updated 2 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- ☆41Jun 25, 2018Updated 7 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ☆13Apr 14, 2024Updated last year
- ☆25Jun 14, 2022Updated 3 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Jul 6, 2023Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆13Feb 5, 2025Updated last year
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 11 months ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Mar 9, 2022Updated 3 years ago
- A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer library, written in Flutter.☆12Nov 18, 2025Updated 3 months ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 4 months ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 10 months ago
- Went online decode demo☆31Apr 28, 2021Updated 4 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- Speech Processing & Linguistic Analysis Tool☆11Jun 30, 2019Updated 6 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Implementation of StyleTTS for Mandarin☆11Jun 22, 2023Updated 2 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆36Feb 5, 2026Updated 3 weeks ago
- An automatic speech recognition environment for Icelandic based on Kaldi☆14Oct 12, 2017Updated 8 years ago