☆27Mar 29, 2021Updated 5 years ago
Alternatives and similar repositories for xlsr-wav2vec2-phoneme-recognition
Users that are interested in xlsr-wav2vec2-phoneme-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Oct 10, 2019Updated 6 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆35Jan 23, 2024Updated 2 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Mar 20, 2021Updated 5 years ago
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 7 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆262May 9, 2022Updated 4 years ago
- ☆11Sep 29, 2020Updated 5 years ago
- ☆18Apr 12, 2021Updated 5 years ago
- Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …☆20May 24, 2023Updated 2 years ago
- 日本音響学会誌用BibTeXスタイルファイル☆11Jan 24, 2022Updated 4 years ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆15Oct 27, 2023Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- This is an AI model using SAM and Grounding DINO to segment objects in a floor plan and effectively remove them in order to get a clean a…☆15Mar 30, 2024Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆52Dec 7, 2021Updated 4 years ago
- Convert CoNLL output of a dependency parser into a latex or graphviz tree☆12Mar 26, 2020Updated 6 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Dec 27, 2022Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- VQVAE for Unsupervised Voice Conversion☆21Apr 25, 2019Updated 7 years ago
- ☆14Aug 19, 2024Updated last year
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Apr 29, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆55May 6, 2024Updated 2 years ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Nov 14, 2019Updated 6 years ago
- Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.☆49Dec 1, 2025Updated 5 months ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- feature extraction from speech signals☆393Jun 15, 2025Updated 10 months ago
- A pronunciation trainer w/ Python.☆15Sep 28, 2025Updated 7 months ago
- Deep semantic role labeling using Tensorflow☆17Sep 30, 2018Updated 7 years ago
- Aim to implement a classifier which classifies an audio sample into speech or music.☆10Sep 17, 2019Updated 6 years ago
- ☆16Mar 19, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆19Mar 30, 2025Updated last year
- 贵州大学研究生学位论文模板☆10Apr 29, 2026Updated last week
- ☆10May 8, 2023Updated 3 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- Magface Triton Inferece Server Using Tensorrt☆18Feb 12, 2022Updated 4 years ago
- ERB representation of an audio file implemented in Python☆27Oct 21, 2018Updated 7 years ago
- ☆11Nov 5, 2021Updated 4 years ago