iamyuanchung/speech2vec-pretrained-vectors

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/iamyuanchung/speech2vec-pretrained-vectors)

iamyuanchung / speech2vec-pretrained-vectors

Speech2vec pre-trained word vectors

☆76

Alternatives and similar repositories for speech2vec-pretrained-vectors

Users that are interested in speech2vec-pretrained-vectors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iamyuanchung / seq2seq-autoencoder
View on GitHub
Theano implementation of Sequence-to-Sequence Autoencoder
☆13Jun 1, 2018Updated 8 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
rxtan2 / video-grounding-narrations
View on GitHub
☆12Mar 12, 2023Updated 3 years ago
zerospeech / zerospeech2020
View on GitHub
Python package for the Zero Speech Challenge 2020
☆14Feb 5, 2021Updated 5 years ago
iamyuanchung / Autoregressive-Predictive-Coding
View on GitHub
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
☆191Jan 29, 2020Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
Qengineering / YoloV6-ncnn-Raspberry-Pi-4
View on GitHub
YoloV6 for a bare Raspberry Pi using ncnn.
☆11Jun 12, 2024Updated 2 years ago
Alexander-H-Liu / NPC
View on GitHub
Non-Autoregressive Predictive Coding
☆51Nov 3, 2020Updated 5 years ago
eastonYi / Unsupervised-ASR
View on GitHub
unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
nttcslab / byol-a
View on GitHub
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
☆237Apr 26, 2023Updated 3 years ago
miguelballesteros / LSTM-punctuation
View on GitHub
☆11Feb 17, 2017Updated 9 years ago
sarulab-speech / lightweight_spkr_anon
View on GitHub
Lightweight speaker anonymization [IEEE SLT2021]
☆27Jun 6, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jasonppy / word-discovery
View on GitHub
Word Discovery in Visually Grounded, Self-Supervised Speech Models
☆27Dec 4, 2023Updated 2 years ago
castorini / honk-models
View on GitHub
Pre-trained models for Honk
☆11Apr 1, 2019Updated 7 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
mfaruqui / eval-word-vectors
View on GitHub
Easy to use scripts for evaluating word vectors on a variety of tasks.
☆119Mar 26, 2021Updated 5 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
jbeliao / SLAM
View on GitHub
☆16Sep 12, 2019Updated 6 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 4 months ago
kaituoxu / X-Punctuator
View on GitHub
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…
☆63May 13, 2020Updated 6 years ago
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
changil / facevoice
View on GitHub
Learning associations between human faces and voices
☆12Feb 15, 2019Updated 7 years ago
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
lingjzhu / spoken_sent_embedding
View on GitHub
Unsupervised spoken sentence embeddings
☆14Dec 14, 2022Updated 3 years ago
RapidAI / RapidPunc
View on GitHub
A library for adding punctuation into a text from ASR.
☆19May 8, 2023Updated 3 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
Yeongtae / tacotron2
View on GitHub
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆30May 28, 2020Updated 6 years ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
noahcao / disentanglement_lib_med
View on GitHub
[NeurIPS 2022] disentanglement evaluation robust to model dimension variance.
☆10Sep 21, 2022Updated 3 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
linan2 / TensorFlow-speech-enhancement
View on GitHub
DNN and RCED speech enhancement
☆20Jan 30, 2024Updated 2 years ago
ImperialNLP / MMT-Delib
View on GitHub
☆10Dec 21, 2022Updated 3 years ago
SuperKogito / SER-datasets
View on GitHub
A collection of datasets for the purpose of emotion recognition/detection in speech.
☆420Sep 30, 2024Updated last year
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
meelement / noise_adversarial_tacotron
View on GitHub
Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…
☆17Aug 15, 2019Updated 6 years ago