awasthiabhijeet / Error-Driven-ASR-PersonalizationView external linksLinks
Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021
☆11Jun 13, 2021Updated 4 years ago
Alternatives and similar repositories for Error-Driven-ASR-Personalization
Users that are interested in Error-Driven-ASR-Personalization are comparing it to the libraries listed below
Sorting:
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 3 years ago
- ☆17Apr 28, 2021Updated 4 years ago
- Deepspeech ASR Model for the Catalan Language☆17Feb 15, 2021Updated 4 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- ☆14Feb 9, 2023Updated 3 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Jan 15, 2026Updated 3 weeks ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Mar 6, 2023Updated 2 years ago
- Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?☆19Oct 5, 2022Updated 3 years ago
- ☆17Mar 1, 2024Updated last year
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- ASR text preprocessing utility☆21Aug 5, 2024Updated last year
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Dec 31, 2021Updated 4 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Aug 16, 2021Updated 4 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Nov 23, 2023Updated 2 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆26Jul 28, 2023Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- Automated, end-to-end wakeword model maker using the Precise Wakeword Engine☆27Feb 23, 2022Updated 3 years ago
- LogicCircuit is a program that helps build/simulate simple circuits using logic gates. It is meant to teach people the basics of how logi…☆10Jan 22, 2025Updated last year
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- An echo cancellation library for browsers using DTLN-aec☆26Oct 18, 2023Updated 2 years ago
- BBB plugin for automatic subtitles in conference calls☆29Apr 14, 2022Updated 3 years ago
- Dynamic time warping (DTW) functions for specifically speech alignment.☆30May 6, 2024Updated last year
- ☆29Jan 15, 2022Updated 4 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Jul 5, 2019Updated 6 years ago
- Source code for the Apple reproduction☆33Apr 23, 2021Updated 4 years ago
- ☆76Oct 25, 2021Updated 4 years ago
- Comprehensive quantitative comparison of lossless and lossy audio codecs☆39Feb 11, 2023Updated 3 years ago