Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large language model
☆55May 6, 2024Updated 2 years ago
Alternatives and similar repositories for mispronunciation-detection-diagnosis-wav2vec2-and-llm
Users that are interested in mispronunciation-detection-diagnosis-wav2vec2-and-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Jun 25, 2019Updated 6 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆35Jan 23, 2024Updated 2 years ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Oct 5, 2022Updated 3 years ago
- ☆28Nov 7, 2023Updated 2 years ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆208Feb 13, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆19Jun 28, 2022Updated 3 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- A pipeline architecture for temporal segmentation of video lectures.☆12Sep 8, 2020Updated 5 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago
- ☆19Sep 5, 2024Updated last year
- Repositório do mini curso sobre noSQL na semana da computação da UFJF 2021☆11Nov 11, 2021Updated 4 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆13Oct 31, 2024Updated last year
- Matlaba and Python Solutions on machine learnign coursera on Coursera by Andrew Ng☆10Jun 23, 2018Updated 7 years ago
- ☆16Jun 13, 2024Updated last year
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- Accompany code to reproduce the baselines of the International Multimodal Sentiment Analysis Challenge (MuSe 2020).☆16Dec 8, 2022Updated 3 years ago
- ☆27Mar 29, 2021Updated 5 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for Information Fusion 2025 Paper "Multi-Source Multi-Modal Domain Adaptation"☆20Feb 4, 2025Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆133Oct 18, 2024Updated last year
- 贵州大学研究生学位论文模板☆10Apr 29, 2026Updated last week
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Voice Conversion method based on speaker style☆14Aug 7, 2021Updated 4 years ago
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆14Jun 23, 2022Updated 3 years ago
- 2020年互联网+方言转换☆14Nov 2, 2020Updated 5 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- ☆58Dec 2, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆27Oct 9, 2024Updated last year
- Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"☆88Jun 10, 2024Updated last year
- ☆18Apr 12, 2021Updated 5 years ago
- Hierarchical Universal Modular ANotator☆12Apr 21, 2026Updated 2 weeks ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Parses a document (scanned or phone captured) and returns the underlying question - answer layout structured capture by LayoutXLM model☆10Jun 14, 2021Updated 4 years ago
- Audio tokenization, in the fastest way possible!☆54Aug 26, 2024Updated last year