Speech2vec pre-trained word vectors
☆76Sep 8, 2018Updated 7 years ago
Alternatives and similar repositories for speech2vec-pretrained-vectors
Users that are interested in speech2vec-pretrained-vectors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Theano implementation of Sequence-to-Sequence Autoencoder☆13Jun 1, 2018Updated 7 years ago
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- ☆12Mar 12, 2023Updated 3 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python package for the Zero Speech Challenge 2020☆14Feb 5, 2021Updated 5 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- YoloV6 for a bare Raspberry Pi using ncnn.☆11Jun 12, 2024Updated last year
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆11Feb 17, 2017Updated 9 years ago
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆229Apr 26, 2023Updated 2 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆27Dec 4, 2023Updated 2 years ago
- Pre-trained models for Honk☆11Apr 1, 2019Updated 7 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated last month
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆63May 13, 2020Updated 5 years ago
- ☆16Sep 12, 2019Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Learning associations between human faces and voices☆12Feb 15, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- ☆15May 26, 2021Updated 4 years ago
- Unsupervised spoken sentence embeddings☆14Dec 14, 2022Updated 3 years ago
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- [NeurIPS 2022] disentanglement evaluation robust to model dimension variance.☆10Sep 21, 2022Updated 3 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆30May 28, 2020Updated 5 years ago
- ☆10Dec 21, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆410Sep 30, 2024Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Aug 15, 2019Updated 6 years ago
- DNN and RCED speech enhancement☆20Jan 30, 2024Updated 2 years ago
- Generate vector embeddings for music☆18Nov 7, 2017Updated 8 years ago