kosuke-kitahara / xlsr-wav2vec2-phoneme-recognitionView external linksLinks
☆27Mar 29, 2021Updated 4 years ago
Alternatives and similar repositories for xlsr-wav2vec2-phoneme-recognition
Users that are interested in xlsr-wav2vec2-phoneme-recognition are comparing it to the libraries listed below
Sorting:
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Oct 10, 2019Updated 6 years ago
- Deep Learning model for lexical stress detection in spoken English☆29Mar 17, 2020Updated 5 years ago
- This is an AI model using SAM and Grounding DINO to segment objects in a floor plan and effectively remove them in order to get a clean a…☆14Mar 30, 2024Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Jan 23, 2024Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- This is a repository dedicated for pre-trained acoustic models of Hong Kong Cantonese and Cantonese forced alignment.☆21Nov 14, 2024Updated last year
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆257May 9, 2022Updated 3 years ago
- Magface Triton Inferece Server Using Tensorrt☆18Feb 12, 2022Updated 4 years ago
- The official code of our paper at EMNLP 2022: Back to the Future: Bidirectional Information Decoupling Network for Multi-turn Dialogue Mo…☆16Feb 17, 2023Updated 3 years ago
- Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …☆20May 24, 2023Updated 2 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆20Aug 14, 2023Updated 2 years ago
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Nov 14, 2019Updated 6 years ago
- ☆25Jul 10, 2023Updated 2 years ago
- Workflow for forced alignment between languages☆23Jan 13, 2026Updated last month
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆48May 6, 2024Updated last year
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆11Jun 28, 2022Updated 3 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Mar 14, 2025Updated 11 months ago
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆76Feb 28, 2024Updated last year
- ☆14May 14, 2025Updated 9 months ago
- Attention-based Hybrid CNN-LSTM and Spectral Data Augmentation for COVID-19 Diagnosis from Cough Sound☆36Aug 31, 2022Updated 3 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 3 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Mar 20, 2021Updated 4 years ago
- 《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm☆82Oct 19, 2023Updated 2 years ago
- Cochlear implant signal processing☆10Jun 24, 2021Updated 4 years ago
- La-O-Dan iOS study☆19Oct 11, 2011Updated 14 years ago
- Tally Prime MCP (Model Context Protocol) Server implementation to feed Tally ERP data to popular LLM like Claude, ChatGPT supporting MCP☆15Nov 11, 2025Updated 3 months ago
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆11Mar 20, 2023Updated 2 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- ☆10Jul 29, 2022Updated 3 years ago
- ☆11Sep 4, 2023Updated 2 years ago
- Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.☆45Dec 1, 2025Updated 2 months ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated last month
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆38Apr 29, 2024Updated last year