Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.
☆30Dec 18, 2019Updated 6 years ago
Alternatives and similar repositories for Application-of-Word2vec-in-Phoneme-Recognition
Users that are interested in Application-of-Word2vec-in-Phoneme-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Phoneme Recognition using RecNet☆97Nov 22, 2016Updated 9 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Nov 27, 2019Updated 6 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆47Jun 24, 2020Updated 5 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Apr 25, 2018Updated 8 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Sep 16, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- VAE with Attention Mechanism for a more powerful representation of interactions☆21Jun 29, 2019Updated 6 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)☆83Nov 13, 2021Updated 4 years ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 6 years ago
- Emotional Speech Conversion using Style Transfer and MUNIT☆37Apr 17, 2019Updated 7 years ago
- The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"☆21Dec 14, 2022Updated 3 years ago
- Generation tool for offset-resistant audio adversarial examples against Deepspeech☆10Oct 5, 2020Updated 5 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆114Feb 27, 2022Updated 4 years ago
- Bias Tests for Voice Technologies (bt4vt)☆11Jun 16, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020☆30Jul 6, 2023Updated 2 years ago
- 以音素建模构建NN-CTC声学模型☆15May 14, 2019Updated 7 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Oct 30, 2018Updated 7 years ago
- 💻 🐈 Added a self-attention layer to the CycleGAN implementation (PyTorch).☆13May 31, 2024Updated last year
- ☆19Jun 28, 2022Updated 3 years ago
- Music IR Library for Python☆13Nov 18, 2015Updated 10 years ago
- A Pytorch implementation of WaveNet ASR (Automatic Speech Recognition)☆13Sep 22, 2021Updated 4 years ago
- Emotional Speech Conversion using Nonparallel Data☆17Apr 10, 2019Updated 7 years ago
- A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popu…☆19Jan 18, 2018Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- ☆19Sep 10, 2024Updated last year
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 6 years ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 5 months ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- Python interface for forced audio alignment using HTK and SoX☆350Jun 28, 2020Updated 5 years ago
- Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)☆146Aug 5, 2022Updated 3 years ago
- A repository of ELL models☆21Jan 16, 2026Updated 4 months ago
- Code for USENIX Security 2023 Paper "Every Vote Counts: Ranking-Based Training of Federated Learning to Resist Poisoning Attacks"☆19May 19, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This repository contains the all my ML KIT projects using flutter.☆14Oct 10, 2022Updated 3 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Jul 6, 2023Updated 2 years ago
- FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)☆19Nov 15, 2023Updated 2 years ago
- Voice conversion using deep adversarial learning☆17Oct 29, 2021Updated 4 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- A Tensorflow SqueezeNet implementation☆14Oct 1, 2018Updated 7 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Jan 28, 2019Updated 7 years ago