☆27Mar 29, 2021Updated 5 years ago
Alternatives and similar repositories for xlsr-wav2vec2-phoneme-recognition
Users that are interested in xlsr-wav2vec2-phoneme-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Oct 10, 2019Updated 6 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆36Jan 23, 2024Updated 2 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Mar 20, 2021Updated 5 years ago
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tensorflow implementation of VQVAE for voice conversion☆12Apr 3, 2018Updated 8 years ago
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆263May 9, 2022Updated 4 years ago
- ☆11Sep 29, 2020Updated 5 years ago
- ☆18Apr 12, 2021Updated 5 years ago
- Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …☆20May 24, 2023Updated 3 years ago
- 日本音響学会誌用BibTeXスタイルファイル☆11Jan 24, 2022Updated 4 years ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆16Oct 27, 2023Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 4 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- This is an AI model using SAM and Grounding DINO to segment objects in a floor plan and effectively remove them in order to get a clean a…☆15Mar 30, 2024Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆52Dec 7, 2021Updated 4 years ago
- Convert CoNLL output of a dependency parser into a latex or graphviz tree☆12Mar 26, 2020Updated 6 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆48Dec 27, 2022Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆106Sep 3, 2021Updated 4 years ago
- VQVAE for Unsupervised Voice Conversion☆21Apr 25, 2019Updated 7 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- ☆14Aug 19, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Nov 14, 2019Updated 6 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- ☆10Jan 29, 2019Updated 7 years ago
- feature extraction from speech signals☆395Jun 15, 2025Updated 11 months ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆55May 6, 2024Updated 2 years ago
- A pronunciation trainer w/ Python.☆15Sep 28, 2025Updated 8 months ago
- Deep semantic role labeling using Tensorflow☆17Sep 30, 2018Updated 7 years ago
- Aim to implement a classifier which classifies an audio sample into speech or music.☆10Sep 17, 2019Updated 6 years ago
- A command line program to check if DTU's Registration portal is up and running and automates the login process by solving the CAPTCHA usi…☆13Jul 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is my undergraduate design project☆13Mar 13, 2017Updated 9 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- fine-tune Whipser model for Taiwanese speech recognition☆37Mar 23, 2023Updated 3 years ago
- ERB representation of an audio file implemented in Python☆27Oct 21, 2018Updated 7 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- A Framework for Symbolic MUsic Graph Explanations☆11Jul 30, 2025Updated 9 months ago
- Base Code for Fidenza Genuary Day 4☆12Jan 4, 2022Updated 4 years ago