sciforce/phones-las

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sciforce/phones-las)

sciforce / phones-las

Articulatory features estimation using Listen Attend and Spell architecture.

☆33

Alternatives and similar repositories for phones-las

Users that are interested in phones-las are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WindQAQ / listen-attend-and-spell
View on GitHub
Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API …
☆90Jan 31, 2019Updated 7 years ago
lucko515 / Speech-commands-recognition
View on GitHub
Recognizing common speech commands using Keras and Tensorflow.
☆10Dec 17, 2018Updated 7 years ago
quatrix / clean-code-julia
View on GitHub
Clean Code concepts adapted for Julia
☆18Mar 14, 2020Updated 6 years ago
TideDancer / iclr22-wctc
View on GitHub
☆15Mar 15, 2022Updated 4 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
KrishnaDN / BERTphone
View on GitHub
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Dec 10, 2020Updated 5 years ago
KrishnaDN / Keyword-Transformer
View on GitHub
Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"
☆23May 19, 2021Updated 5 years ago
jcvasquezc / phonet
View on GitHub
Keras-based python framework to compute phonological posterior probabilities from audio files
☆48Dec 27, 2022Updated 3 years ago
monikaUPF / PyToBI
View on GitHub
A Toolkit for ToBI Labeling with Python Data Structures
☆25May 16, 2022Updated 4 years ago
kamperh / recipe_swbd_wordembeds
View on GitHub
☆22Mar 22, 2017Updated 9 years ago
archiki / ASR-Accent-Analysis
View on GitHub
Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.
☆15Jun 27, 2020Updated 6 years ago
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
dspavankumar / keras-kaldi
View on GitHub
Keras Interface for Kaldi ASR
☆122Sep 27, 2017Updated 8 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
idiap / icassp-oov-recognition
View on GitHub
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Nov 28, 2021Updated 4 years ago
zcf28 / StyleGAN-VC
View on GitHub
Voice Conversion method based on speaker style
☆14Aug 7, 2021Updated 4 years ago
robflynnyh / long-context-asr
View on GitHub
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆11Jul 3, 2026Updated 3 weeks ago
idiap / pkwrap
View on GitHub
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆74Jun 8, 2022Updated 4 years ago
shahruk10 / kaldi-tflite
View on GitHub
Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …
☆20Oct 6, 2022Updated 3 years ago
shivamsaboo17 / PySNIP
View on GitHub
Single shot neural network pruning before training the model, based on connection sensitivity
☆11Aug 7, 2019Updated 6 years ago
fedderrico / ubm_map_diarization
View on GitHub
Speaker diarization with GMM-UBM and MAP Adaptation
☆31Sep 13, 2018Updated 7 years ago
zw76859420 / ASR_Phone
View on GitHub
以音素建模构建NN-CTC声学模型
☆16May 14, 2019Updated 7 years ago
stefan-it / ukrainian-electra
View on GitHub
Ukrainian ELECTRA model
☆12Mar 11, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ftyers / commonvoice-utils
View on GitHub
Linguistic processing for Common Voice
☆59Jan 18, 2024Updated 2 years ago
MLSpeech / Dr.VOT
View on GitHub
Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).
☆33Jul 25, 2023Updated 3 years ago
craffel / mocha
View on GitHub
Example implementation of Monotonic Chunkwise Attention.
☆54Feb 23, 2018Updated 8 years ago
SilvrDuck / AccentedSpeechRecognition
View on GitHub
Experiments on speech recognition robustness to accents and dialects
☆12Apr 2, 2019Updated 7 years ago
lazear / simd-euclidean
View on GitHub
Calculation of euclidean distance between vectors, with SIMD
☆13Jan 17, 2024Updated 2 years ago
holm-aune-bachelor2018 / ctc
View on GitHub
Speech recognition with CTC in Keras with Tensorflow backend
☆31Mar 24, 2023Updated 3 years ago
ynop / audiomate
View on GitHub
Python library for handling audio datasets.
☆139Jul 6, 2023Updated 3 years ago
dopefishh / praatalign
View on GitHub
A Praat plug-in for performing interactive phonetic forced alignment
☆29Sep 22, 2018Updated 7 years ago
xingchensong / Speech-Transformer-tf2.0
View on GitHub
transformer for ASR-systerm (via tensorflow2.0)
☆114May 7, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
luohongyin / bdfa-torch
View on GitHub
Training neural networks with back-prop, feedback-alignment and direct feedback-alignment
☆11Mar 20, 2017Updated 9 years ago
WxxShirley / KDD2024ProCom
View on GitHub
Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"
☆11Aug 15, 2024Updated last year
aalto-speech / interspeech2019_karhila_et_al
View on GitHub
Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…
☆25May 6, 2019Updated 7 years ago
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
bond005 / vad
View on GitHub
Various algorithms for voice activity detection
☆22Jan 31, 2017Updated 9 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
gbegus / articulationGAN
View on GitHub
☆24Sep 1, 2023Updated 2 years ago