sciforce / phones-lasView external linksLinks
Articulatory features estimation using Listen Attend and Spell architecture.
☆33Apr 24, 2020Updated 5 years ago
Alternatives and similar repositories for phones-las
Users that are interested in phones-las are comparing it to the libraries listed below
Sorting:
- Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API …☆89Jan 31, 2019Updated 7 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- ☆22Mar 22, 2017Updated 8 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 2 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- ☆15Oct 29, 2024Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Companion toolkit of the 'Serial Speakers' dataset.☆11Feb 17, 2020Updated 6 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆10Dec 17, 2018Updated 7 years ago
- Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).☆29Jul 25, 2023Updated 2 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Jun 29, 2021Updated 4 years ago
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs☆31Dec 15, 2014Updated 11 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Jan 28, 2026Updated 2 weeks ago
- ☆15Apr 20, 2018Updated 7 years ago
- Generates masks you can print out, cut and wear from images of faces.☆13Jun 23, 2017Updated 8 years ago
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- Speaker diarization python system based on binary key speaker modelling☆60Jan 12, 2022Updated 4 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- Speech in Flax/JAX☆15Jul 11, 2022Updated 3 years ago
- ☆20Nov 3, 2021Updated 4 years ago
- A Python package for audio annotation and classifier training. Developed in collaboration with the WGBH Foundation and the American Archi…☆17Jun 2, 2018Updated 7 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- Grapheme to phoneme model for PyTorch☆43Jul 21, 2022Updated 3 years ago
- Text-based media editing interface☆16Aug 9, 2017Updated 8 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- ☆23Jan 21, 2022Updated 4 years ago
- GPT-2 Metadata Pretraining Towards Instruction Finetuning for Ukrainian☆20Aug 6, 2023Updated 2 years ago
- Small-footprint Keyword Spotting☆18Jul 28, 2019Updated 6 years ago
- Clean Code concepts adapted for Julia☆18Mar 14, 2020Updated 5 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- C++ Implementation of the Information Bottleneck System☆22Jan 9, 2019Updated 7 years ago
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17May 13, 2014Updated 11 years ago