telecombcn-dl / labs-all
Labs for deep learning courses at UPC ETSETB TelecomBCN.
☆15Updated 3 weeks ago
Alternatives and similar repositories for labs-all:
Users that are interested in labs-all are comparing it to the libraries listed below
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 3 years ago
- This repository implements SummaryMixing, a simpler, faster and much cheaper replacement to self-attention for automatic speech recogniti…☆113Updated 4 months ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆8Updated 2 years ago
- ☆8Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆38Updated 2 years ago
- The official repository for Audio ALBERT☆64Updated 2 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆24Updated last year
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆42Updated 2 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 2 years ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 2 years ago
- Example code for a neural transducer model.☆61Updated 11 months ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- ☆16Updated 5 years ago
- ☆12Updated 3 years ago
- VIsually-Pivoted Audio and(N) Text☆22Updated 2 years ago
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆15Updated 3 years ago
- Minimal implementation of Contrastive Predictive Coding for audio.☆16Updated 5 years ago
- ☆47Updated 2 years ago
- SubER - Subtitle Edit Rate☆22Updated 5 months ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated last week