fengxin-bupt/Application-of-Word2vec-in-Phoneme-Recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fengxin-bupt/Application-of-Word2vec-in-Phoneme-Recognition)

fengxin-bupt / Application-of-Word2vec-in-Phoneme-Recognition

Build an attention-based model for speech recogntion.Use the Word2vec model to help to train the attention model.

☆30

Alternatives and similar repositories for Application-of-Word2vec-in-Phoneme-Recognition

Users that are interested in Application-of-Word2vec-in-Phoneme-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JoergFranke / phoneme_recognition
View on GitHub
Phoneme Recognition using RecNet
☆97Nov 22, 2016Updated 9 years ago
HanSeokhyeon / Deep_learning_for_Phoneme_recognition
View on GitHub
다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.
☆13Nov 27, 2019Updated 6 years ago
tbornt / phoneme_ctc
View on GitHub
Bidirectional dynamic RNN + CTC for phoneme recognition
☆47Jun 24, 2020Updated 6 years ago
AppleHolic / PytorchSR
View on GitHub
Pytorch based phoneme recognition (TIMIT phoneme classification)
☆35Apr 25, 2018Updated 8 years ago
getalp / mass-dataset
View on GitHub
MaSS - Multilingual corpus of Sentence-aligned Spoken utterances
☆50Sep 16, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ArnaudFickinger / Attention_VAE
View on GitHub
VAE with Attention Mechanism for a more powerful representation of interactions
☆21Jun 29, 2019Updated 7 years ago
felixkreuk / SegFeat
View on GitHub
Phoneme Boundary Detection using Learnable Segmental Features (ICASSP 2020)
☆83Nov 13, 2021Updated 4 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
nusnlp / greco
View on GitHub
The official code for the "System Combination via Quality Estimation for Grammatical Error Correction" paper, published in EMNLP 2023.
☆16Jan 24, 2026Updated 5 months ago
1ytic / warp-rna
View on GitHub
Recurrent Neural Aligner
☆51Apr 14, 2020Updated 6 years ago
zw76859420 / ASR_Phone
View on GitHub
以音素建模构建NN-CTC声学模型
☆16May 14, 2019Updated 7 years ago
bottlecapper / EmoMUNIT
View on GitHub
Emotional Speech Conversion using Style Transfer and MUNIT
☆37Apr 17, 2019Updated 7 years ago
chorowski-lab / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆10Feb 22, 2022Updated 4 years ago
upskyy / Transformer-Transducer
View on GitHub
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆114Feb 27, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Fraunhofer-AISEC / towards-resistant-audio-adversarial-examples
View on GitHub
Generation tool for offset-resistant audio adversarial examples against Deepspeech
☆10Oct 5, 2020Updated 5 years ago
apple / ml-omni-router-moe-asr
View on GitHub
☆18Oct 24, 2025Updated 8 months ago
wiebket / bt4vt
View on GitHub
Bias Tests for Voice Technologies (bt4vt)
☆11Jun 16, 2024Updated 2 years ago
csalt-research / accented-codebooks-asr
View on GitHub
☆19Sep 10, 2024Updated last year
Topaz1618 / CycleganSA
View on GitHub
💻 🐈 Added a self-attention layer to the CycleGAN implementation (PyTorch).
☆13May 31, 2024Updated 2 years ago
AlexeySorokin / EditScorer
View on GitHub
The code for EMNLP2022 paper "Improved grammatical error correction by ranking elementary edits"
☆21Dec 14, 2022Updated 3 years ago
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
ZihaoZhao / Pytorch-ASR-WaveNet
View on GitHub
A Pytorch implementation of WaveNet ASR (Automatic Speech Recognition)
☆13Sep 22, 2021Updated 4 years ago
bottlecapper / EmoCycleGAN
View on GitHub
Emotional Speech Conversion using Nonparallel Data
☆17Apr 10, 2019Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
AI-secure / Characterizing-Audio-Adversarial-Examples-using-Temporal-Dependency
View on GitHub
ICLR 2019 Paper, "Characterizing Audio Adversarial Examples using Temporal Dependency".
☆11Apr 3, 2019Updated 7 years ago
ZhengkunTian / rnn-transducer
View on GitHub
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
☆239May 12, 2020Updated 6 years ago
ryuzho / DiffVC
View on GitHub
Diffusion Model for Voice Conversion
☆17Oct 11, 2022Updated 3 years ago
mmmmayi / ExPO
View on GitHub
official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification
☆14Mar 14, 2025Updated last year
felixkreuk / UnsupSeg
View on GitHub
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
☆146Aug 5, 2022Updated 3 years ago
Deepest-Project / Transformer-TTS
View on GitHub
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆64Jul 6, 2023Updated 3 years ago
hubertsiuzdak / voice-conversion
View on GitHub
Voice conversion using deep adversarial learning
☆17Oct 29, 2021Updated 4 years ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
iamyuanchung / Autoregressive-Predictive-Coding
View on GitHub
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
☆191Jan 29, 2020Updated 6 years ago
glory20h / FitHuBERT
View on GitHub
FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)
☆19Nov 15, 2023Updated 2 years ago
Devil4ngle / SquadMortarOverlay
View on GitHub
Squad Mortar Overlay: Overlays Squad map with the SquadCalc map
☆13Jun 14, 2026Updated last month
JaesungBae / Speech-Command-Recognition-with-Capsule-Network
View on GitHub
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
☆25Jan 28, 2019Updated 7 years ago
zqs01 / data2vecnoisy
View on GitHub
☆11Oct 20, 2022Updated 3 years ago
mravanelli / pytorch_MLP_for_ASR
View on GitHub
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…
☆40Feb 10, 2018Updated 8 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 4 years ago