proger / haloop
Agent toolkit for 100 hours of speech and 10 GiB of text
☆13Updated last year
Alternatives and similar repositories for haloop:
Users that are interested in haloop are comparing it to the libraries listed below
- phone inventory library☆16Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆27Updated last year
- A collection of utilities for handling IPA phones.☆25Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated this week
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆62Updated 11 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 2 weeks ago
- Viterbi decoding in PyTorch☆27Updated 4 months ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆19Updated 3 months ago
- Фонограми та синтагми: інструменти обробки☆21Updated last month
- ☆13Updated 2 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 4 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆34Updated 7 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆21Updated this week
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 11 months ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆50Updated 6 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆24Updated 5 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆69Updated 2 years ago
- ☆32Updated 3 years ago
- Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.☆44Updated 5 years ago
- ☆37Updated 3 years ago
- Collection of scripts from mHuBERT-147.☆24Updated 3 months ago
- python wrapper for kaldi's native I/O☆27Updated last month
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- ☆17Updated last year
- ☆34Updated 5 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year