kamperh / globalphone_aweView external linksLinks
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
☆11Nov 3, 2020Updated 5 years ago
Alternatives and similar repositories for globalphone_awe
Users that are interested in globalphone_awe are comparing it to the libraries listed below
Sorting:
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆12Nov 14, 2024Updated last year
- ☆11Feb 17, 2017Updated 9 years ago
- ☆13Jan 14, 2025Updated last year
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Jan 11, 2020Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Pure C# port of the Pocketsphinx keyword spotter☆13Jan 19, 2020Updated 6 years ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 9 years ago
- Feature extraction for accented-speech or pathological speech☆17Apr 2, 2019Updated 6 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago
- ☆17Jul 22, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- ☆15May 8, 2021Updated 4 years ago
- Framework for one-shot multispeaker system based on Deep Learning☆19May 30, 2021Updated 4 years ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- ☆21Jan 13, 2020Updated 6 years ago
- Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model☆17Nov 24, 2016Updated 9 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 2 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Nov 9, 2020Updated 5 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated 11 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 3 months ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- ☆22Jun 24, 2024Updated last year
- Analytic signal-based source information analysis for YANGstraight and real-time interactive tools☆34Aug 20, 2019Updated 6 years ago
- ☆26Dec 4, 2024Updated last year
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago