Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large language model
☆57May 6, 2024Updated 2 years ago
Alternatives and similar repositories for mispronunciation-detection-diagnosis-wav2vec2-and-llm
Users that are interested in mispronunciation-detection-diagnosis-wav2vec2-and-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Jun 25, 2019Updated 6 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆38Jan 23, 2024Updated 2 years ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Oct 5, 2022Updated 3 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆54Nov 17, 2021Updated 4 years ago
- ☆28Nov 7, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆19Jun 28, 2022Updated 3 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- ☆51Apr 12, 2024Updated 2 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- A study of the downstream instability of word embeddings☆12Aug 23, 2022Updated 3 years ago
- Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …☆20May 24, 2023Updated 3 years ago
- ☆25Jun 14, 2022Updated 4 years ago
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Matlaba and Python Solutions on machine learnign coursera on Coursera by Andrew Ng☆10Jun 23, 2018Updated 7 years ago
- ☆16Jun 13, 2024Updated 2 years ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- Aim to implement a classifier which classifies an audio sample into speech or music.☆10Sep 17, 2019Updated 6 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆38Dec 5, 2023Updated 2 years ago
- ☆27Mar 29, 2021Updated 5 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆134Oct 18, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 贵州大学研究生学位论文模板☆12Apr 29, 2026Updated last month
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Voice Conversion method based on speaker style☆14Aug 7, 2021Updated 4 years ago
- 2020年互联网+方言转换☆14Nov 2, 2020Updated 5 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"☆89Jun 10, 2024Updated 2 years ago
- Hierarchical Universal Modular ANotator☆12May 9, 2026Updated last month
- Leveraging BERT to Improve Spoken Language Identification☆18Nov 22, 2022Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Parses a document (scanned or phone captured) and returns the underlying question - answer layout structured capture by LayoutXLM model☆10Jun 14, 2021Updated 5 years ago
- Code for the paper "Transcribing Human Piano Performances into Music Notation" accepted at ISMIR 2016☆16Jan 24, 2018Updated 8 years ago
- Audio tokenization, in the fastest way possible!☆54Aug 26, 2024Updated last year
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆14Dec 21, 2024Updated last year
- A Python Toolbox for Sonifying Music Annotations and Feature Representations☆26Mar 24, 2025Updated last year
- Acoustic distance measure for comparing pronunciations☆17Aug 2, 2022Updated 3 years ago