☆27Mar 29, 2021Updated 5 years ago
Alternatives and similar repositories for xlsr-wav2vec2-phoneme-recognition
Users that are interested in xlsr-wav2vec2-phoneme-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆35Jan 23, 2024Updated 2 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- Deep Learning model for lexical stress detection in spoken English☆28Mar 17, 2020Updated 6 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆261May 9, 2022Updated 3 years ago
- ☆11Sep 29, 2020Updated 5 years ago
- ☆18Apr 12, 2021Updated 5 years ago
- Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …☆20May 24, 2023Updated 2 years ago
- 日本音響学会誌用BibTeXスタイルファイル☆11Jan 24, 2022Updated 4 years ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆15Oct 27, 2023Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 3 years ago
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is an AI model using SAM and Grounding DINO to segment objects in a floor plan and effectively remove them in order to get a clean a…☆15Mar 30, 2024Updated 2 years ago
- Convert CoNLL output of a dependency parser into a latex or graphviz tree☆12Mar 26, 2020Updated 6 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Dec 27, 2022Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- VQVAE for Unsupervised Voice Conversion☆21Apr 25, 2019Updated 6 years ago
- Particle Swarm Optimization (PSO) code in Python, using Matplotlib and Numpy.☆11Apr 20, 2016Updated 9 years ago
- ☆14Aug 19, 2024Updated last year
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Mar 23, 2026Updated 3 weeks ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆52May 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Nov 14, 2019Updated 6 years ago
- Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.☆49Dec 1, 2025Updated 4 months ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- feature extraction from speech signals☆393Jun 15, 2025Updated 10 months ago
- A pronunciation trainer w/ Python.☆14Sep 28, 2025Updated 6 months ago
- Aim to implement a classifier which classifies an audio sample into speech or music.☆10Sep 17, 2019Updated 6 years ago
- Deep semantic role labeling using Tensorflow☆17Sep 30, 2018Updated 7 years ago
- ☆16Mar 19, 2021Updated 5 years ago
- Implementation of Baseline for Scene Text-to-Scene Text Translation☆19Mar 30, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- Magface Triton Inferece Server Using Tensorrt☆18Feb 12, 2022Updated 4 years ago
- ERB representation of an audio file implemented in Python☆27Oct 21, 2018Updated 7 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- A Framework for Symbolic MUsic Graph Explanations☆10Jul 30, 2025Updated 8 months ago
- Adversarial Discriminative Domain Adaptation in Chainer☆23Nov 20, 2017Updated 8 years ago
- ☆12Jun 1, 2024Updated last year