☆27Mar 29, 2021Updated 5 years ago
Alternatives and similar repositories for xlsr-wav2vec2-phoneme-recognition
Users that are interested in xlsr-wav2vec2-phoneme-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆29Oct 10, 2019Updated 6 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆38Jan 23, 2024Updated 2 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Mar 20, 2021Updated 5 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tensorflow implementation of VQVAE for voice conversion☆12Apr 3, 2018Updated 8 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- ☆18Apr 12, 2021Updated 5 years ago
- Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …☆20May 24, 2023Updated 3 years ago
- 日本音響学会誌用BibTeXスタイルファイル☆11Jan 24, 2022Updated 4 years ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆16Oct 27, 2023Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆34May 18, 2022Updated 4 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is an AI model using SAM and Grounding DINO to segment objects in a floor plan and effectively remove them in order to get a clean a…☆15Mar 30, 2024Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆52Dec 7, 2021Updated 4 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆48Dec 27, 2022Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆106Sep 3, 2021Updated 4 years ago
- VQVAE for Unsupervised Voice Conversion☆21Apr 25, 2019Updated 7 years ago
- ☆14Aug 19, 2024Updated last year
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Updated this week
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Nov 14, 2019Updated 6 years ago
- Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.☆49Dec 1, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- feature extraction from speech signals☆395Jun 10, 2026Updated last week
- Deep semantic role labeling using Tensorflow☆17Sep 30, 2018Updated 7 years ago
- Aim to implement a classifier which classifies an audio sample into speech or music.☆10Sep 17, 2019Updated 6 years ago
- ☆16Mar 19, 2021Updated 5 years ago
- 贵州大学研究生学位论文模板☆12Apr 29, 2026Updated last month
- fine-tune Whipser model for Taiwanese speech recognition☆37Mar 23, 2023Updated 3 years ago
- Magface Triton Inferece Server Using Tensorrt☆18Feb 12, 2022Updated 4 years ago
- ERB representation of an audio file implemented in Python☆27Oct 21, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Nov 5, 2021Updated 4 years ago
- ☆12Jun 1, 2024Updated 2 years ago
- A repository of Japanese Phoneme-Level BERT☆24Dec 16, 2023Updated 2 years ago
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆75Feb 28, 2024Updated 2 years ago
- Leveraging BERT to Improve Spoken Language Identification☆18Nov 22, 2022Updated 3 years ago
- Quickly delete all of your PSN friends☆11Mar 16, 2024Updated 2 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆100Nov 20, 2024Updated last year