Speech2vec pre-trained word vectors
☆76Sep 8, 2018Updated 7 years ago
Alternatives and similar repositories for speech2vec-pretrained-vectors
Users that are interested in speech2vec-pretrained-vectors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Theano implementation of Sequence-to-Sequence Autoencoder☆13Jun 1, 2018Updated 7 years ago
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- ☆12Mar 12, 2023Updated 3 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python package for the Zero Speech Challenge 2020☆14Feb 5, 2021Updated 5 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- YoloV6 for a bare Raspberry Pi using ncnn.☆11Jun 12, 2024Updated last year
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆11Feb 17, 2017Updated 9 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆231Apr 26, 2023Updated 3 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆27Dec 4, 2023Updated 2 years ago
- Pre-trained models for Honk☆11Apr 1, 2019Updated 7 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Easy to use scripts for evaluating word vectors on a variety of tasks.☆119Mar 26, 2021Updated 5 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 2 months ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆63May 13, 2020Updated 5 years ago
- ☆16Sep 12, 2019Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Learning associations between human faces and voices☆12Feb 15, 2019Updated 7 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- ☆15May 26, 2021Updated 4 years ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- [NeurIPS 2022] disentanglement evaluation robust to model dimension variance.☆10Sep 21, 2022Updated 3 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆30May 28, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆10Dec 21, 2022Updated 3 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆413Sep 30, 2024Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Aug 15, 2019Updated 6 years ago
- DNN and RCED speech enhancement☆20Jan 30, 2024Updated 2 years ago