telecombcn-dl / labs-allLinks
Labs for deep learning courses at UPC ETSETB TelecomBCN.
☆16Updated last week
Alternatives and similar repositories for labs-all
Users that are interested in labs-all are comparing it to the libraries listed below
Sorting:
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Updated last year
- ☆32Updated this week
- Phonetically-Oriented Word Error Rate☆36Updated 6 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆88Updated last year
- ☆10Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 4 years ago
- Example code for a neural transducer model.☆66Updated last year
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Updated 2 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Updated 3 years ago
- Listen Attend and Spell (LAS) implement in pytorch☆60Updated 7 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆40Updated 2 years ago
- VB Diarization with Eigenvoice and HMM Priors, refactored☆15Updated 4 years ago
- ☆30Updated 3 years ago
- EVAR ~ Evaluation package for Audio Representations☆64Updated last month
- Layer-wise analysis of self-supervised pre-trained speech representations☆116Updated 11 months ago
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆85Updated 2 months ago
- NIST SPH File reader (e.g. for TEDLIUM Corpus)☆26Updated 5 years ago
- Evaluation kit for the HEAR Benchmark☆60Updated this week
- Wav2Vec for speech recognition, classification, and audio classification☆266Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 3 years ago
- Raw waveform adaptation with SincNet☆12Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆43Updated 2 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆51Updated 5 years ago
- The VoxTube dataset official repository☆70Updated last year
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 4 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆23Updated 7 months ago
- Python toolkit for speech processing☆71Updated last month
- Toolkit for downloading and processing Google's AudioSet dataset.☆170Updated last month
- Repository for speech paper reading☆33Updated 4 years ago