proger / haloop
Agent toolkit for 100 hours of speech and 10 GiB of text
☆13Updated last year
Alternatives and similar repositories for haloop
Users that are interested in haloop are comparing it to the libraries listed below
Sorting:
- A collection of utilities for handling IPA phones.☆25Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆51Updated 9 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆30Updated 2 years ago
- phone inventory library☆16Updated last year
- A handy dataset of noises for ASR☆21Updated 5 years ago
- ☆20Updated 6 years ago
- ☆22Updated 3 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆22Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆22Updated 2 months ago
- ☆36Updated 2 weeks ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆31Updated 2 weeks ago
- ☆13Updated 3 years ago
- Viterbi decoding in PyTorch☆32Updated last month
- Unofficial implementation of wavenext vocoder☆45Updated 8 months ago
- ☆17Updated 2 years ago
- ☆62Updated last year
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated last year
- ☆56Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆64Updated last year
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 3 years ago
- ☆15Updated 2 years ago
- ☆16Updated 2 years ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Updated 4 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆22Updated 8 months ago
- NVIDIA's FastPitch, extracted from the DeepLearningExamples repository☆13Updated last year
- ☆21Updated 5 years ago