telecombcn-dl / labs-allLinks
Labs for deep learning courses at UPC ETSETB TelecomBCN.
☆17Updated 2 weeks ago
Alternatives and similar repositories for labs-all
Users that are interested in labs-all are comparing it to the libraries listed below
Sorting:
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 5 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆91Updated last year
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Updated last year
- Repository for multilingual speech data resources for native languages of Zambia.☆19Updated last year
- Phonetically-Oriented Word Error Rate☆36Updated 6 years ago
- ☆30Updated 3 years ago
- VB Diarization with Eigenvoice and HMM Priors, refactored☆15Updated 4 years ago
- Deep Articulatory Synthesis and Inversion☆54Updated last year
- ☆34Updated last month
- The VoxTube dataset official repository☆71Updated last year
- Balanced Error Rate for Speaker Diarization☆33Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆45Updated 2 years ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆120Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆23Updated 9 months ago
- EVAR ~ Evaluation package for Audio Representations☆70Updated 2 weeks ago
- ☆19Updated 3 years ago
- Simple Python package for fast DER computation☆35Updated 2 years ago
- ☆10Updated 2 years ago
- Script to perform statistical significance test between ASR hypotheses.☆22Updated 8 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference☆51Updated 6 years ago
- Pytorch port of Google Research's LEAF Audio paper☆93Updated 4 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- The official repository for Audio ALBERT☆67Updated 3 years ago
- NIST SPH File reader (e.g. for TEDLIUM Corpus)☆26Updated 5 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 3 years ago
- A merged version of multiple open-source German speech datasets.☆33Updated last year
- ☆54Updated 2 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆77Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago