proger / haloop
Agent toolkit for 100 hours of speech and 10 GiB of text
☆13Updated last year
Alternatives and similar repositories for haloop:
Users that are interested in haloop are comparing it to the libraries listed below
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆30Updated 2 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- Фонограми та синтагми: інструменти обробки☆21Updated 2 weeks ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆21Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆63Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆22Updated last month
- Convert English text from written expressions into spoken forms☆25Updated 2 years ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- ☆56Updated 2 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated 2 years ago
- Deep Speech Distances PyTorch☆28Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 9 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 3 years ago
- Unofficial implementation of wavenext vocoder☆44Updated 7 months ago
- ☆15Updated 2 years ago
- ☆20Updated this week
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 4 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- ☆35Updated last month
- ☆16Updated last year
- scipts for working with open.bible data☆24Updated 3 years ago
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models☆51Updated last month
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆41Updated last year
- phone inventory library☆16Updated last year
- ☆11Updated 3 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆50Updated 8 months ago
- ☆17Updated 2 years ago