crazycloud / mispronunciation-detection-diagnosis-wav2vec2-and-llmView external linksLinks
Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large language model
☆48May 6, 2024Updated last year
Alternatives and similar repositories for mispronunciation-detection-diagnosis-wav2vec2-and-llm
Users that are interested in mispronunciation-detection-diagnosis-wav2vec2-and-llm are comparing it to the libraries listed below
Sorting:
- ☆28Nov 7, 2023Updated 2 years ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Oct 5, 2022Updated 3 years ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciat…☆26Jun 25, 2019Updated 6 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Jan 23, 2024Updated 2 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆199Feb 13, 2023Updated 3 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …☆20May 24, 2023Updated 2 years ago
- Workflow for forced alignment between languages☆23Jan 13, 2026Updated last month
- ☆25Jun 14, 2022Updated 3 years ago
- Goodness of Pronunciation (GOP) for oral reading assessment.☆53Nov 17, 2021Updated 4 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆11Jun 28, 2022Updated 3 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆35Feb 5, 2026Updated last week
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- ☆27Mar 29, 2021Updated 4 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Tally Prime MCP (Model Context Protocol) Server implementation to feed Tally ERP data to popular LLM like Claude, ChatGPT supporting MCP☆15Nov 11, 2025Updated 3 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated last month
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated last year
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆11Mar 20, 2023Updated 2 years ago
- ☆10Jul 29, 2022Updated 3 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- Hierarchical Universal Modular ANotator☆11Feb 7, 2026Updated last week
- "SSPNet: An interpretable 3D-CNN for classification of schizophrenia using phase maps of resting-state complex-valued fMRI data," publish…☆10May 13, 2022Updated 3 years ago
- Scraping LegiFrance naturalisation decrees for fun and OSINT profit☆11May 27, 2023Updated 2 years ago
- Deepfake faces detection from forged videos where used explainable AI for models' robustness as well as cost sensitive methods for mitiga…☆10May 27, 2024Updated last year
- 该仓库是 BUPT 智能系统实验室的法律大模型项目,基于 ChatGLM 等开源大模型进行实现。☆11Nov 28, 2023Updated 2 years ago
- ImageQA is a tool for analyzing digital image quality according to specific attributes such as color, tone transfer, noise or resolution.…☆10Sep 18, 2024Updated last year
- A reddit scraping and analysis bot to visualize linguistic and content trends☆12Oct 5, 2021Updated 4 years ago
- “Welcome to my GitHub repository, a hub of exploration and innovation in the realm of data science. 📊💻 Here, you’ll find a curated coll…☆10Apr 3, 2025Updated 10 months ago
- Code/Report/Image Plagiarism finder☆12Apr 1, 2025Updated 10 months ago
- This is a dehazed method for remote sensing image, which based on CycleGAN.☆12May 10, 2022Updated 3 years ago
- Deep metric learning: Triplet, Magnet and VMF loss☆11Aug 19, 2022Updated 3 years ago
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- A generative deep learning model based on GAN architecture was implemented to generate synthetic network data (benign and malicious) alik…☆10Oct 23, 2021Updated 4 years ago