Alexander-H-Liu / NPCLinks

Non-Autoregressive Predictive Coding

☆51

Alternatives and similar repositories for NPC

Users that are interested in NPC are comparing it to the libraries listed below

Sorting:

Hertin / WavPrompt
☆36Updated 3 years ago
mechanicalsea / lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆74Updated 2 years ago
sky1456723 / Pytorch-MBNet
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
☆60Updated 3 years ago
iamyuanchung / VQ-APC
Vector Quantized Autoregressive Predictive Coding (VQ-APC)
☆37Updated 4 years ago
grtzsohalf / SpeechNet-codebase
☆20Updated 4 years ago
kan-bayashi / LibriTTSLabel
Alignment files of LibriTTS.
☆64Updated 5 years ago
MU94W / TTS-Eval
☆18Updated 6 years ago
xinjli / alqalign
multilingual speech aligner
☆74Updated last year
nii-yamagishilab / VCC2020-database
☆52Updated 4 years ago
felixkreuk / UnsupSeg
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation (INTERSPEECH 2020)
☆141Updated 2 years ago
kamperh / vqwordseg
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆37Updated last year
HaoranMiao / streaming-attention
streaming attention networks for end-to-end automatic speech recognition
☆55Updated 5 years ago
csukuangfj / transducer-loss-benchmarking
☆68Updated 3 years ago
ga642381 / SpeechPrompt-v2
《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm
☆81Updated last year
bigpon / SpeechSubjectiveTest
Speech (audio) subjective evaluation system
☆39Updated 5 years ago
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆48Updated 6 months ago
zerospeech / zerospeech2021_baseline
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
☆60Updated 2 years ago
HarunoriKawano / BEST-RQ
Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.
☆81Updated 2 years ago
HuangZiliAndy / SSL_for_multitalker
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆29Updated 2 years ago
HuangZiliAndy / RPNSD
PyTorch implementation of RPNSD
☆60Updated last year
k2-fsa / fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆142Updated last year
Deepest-Project / AlignTTS
Implementation of the AlignTTS
☆77Updated 2 years ago
mutiann / speech_rankings
A CSRankings-like index for speech researchers
☆34Updated 8 months ago
ga642381 / SpeechPrompt
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…
☆101Updated 3 months ago
ga642381 / Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
☆110Updated last year
MarkWuNLP / SemanticMask
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆38Updated 5 years ago
TeaPoly / Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
☆44Updated 2 years ago
ericwudayi / SkipVQVC
An implementation of SkipVQVC with various settings.
☆75Updated 5 years ago
NaoyukiKanda / LibriSpeechMix
☆35Updated 4 years ago
ankitapasad / layerwise-analysis
Layer-wise analysis of self-supervised pre-trained speech representations
☆107Updated 8 months ago