Agent toolkit for 100 hours of speech and 10 GiB of text
☆14Jul 15, 2025Updated 7 months ago
Alternatives and similar repositories for haloop
Users that are interested in haloop are comparing it to the libraries listed below
Sorting:
- GPT-2 Metadata Pretraining Towards Instruction Finetuning for Ukrainian☆20Aug 6, 2023Updated 2 years ago
- Машинне навчання для інженерів із систем керування☆11Jul 19, 2023Updated 2 years ago
- ☆15Oct 29, 2024Updated last year
- Open Source Crimean Tatar Text-to-Speech datasets☆14Feb 23, 2025Updated last year
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Use quantized versions of Whisper to speed up inference☆12Oct 16, 2024Updated last year
- Script to train a German n-gram Language Model on articles of Wikipedia☆14Oct 20, 2018Updated 7 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Experimental repository for NER (Named-entity recognition) for sentences of Ukrainian language.☆13Aug 13, 2021Updated 4 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Dec 5, 2022Updated 3 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- Dictionary of word stresses in the Ukrainian language 🇺🇦☆22Sep 29, 2024Updated last year
- Dictionary of obscene words for Ukrainian language☆22May 15, 2025Updated 9 months ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Mar 15, 2020Updated 5 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Oct 21, 2025Updated 4 months ago
- A small rust-based data loader☆36Feb 20, 2026Updated last week
- scipts for working with open.bible data☆26Jan 24, 2022Updated 4 years ago
- Fun pet project for creating Ukrainian-speaking Conversational AI☆20May 4, 2023Updated 2 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- Text language identification using Wikipedia data☆31Aug 15, 2017Updated 8 years ago
- UCU Audio Processing Course☆39Feb 23, 2026Updated last week
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Jun 25, 2024Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆34Mar 31, 2023Updated 2 years ago
- Whisper fine-tuning event script to use multiple hf datasets☆32Dec 20, 2022Updated 3 years ago
- Golang bindings for Coqui's speech-to-text library☆34Aug 19, 2022Updated 3 years ago
- A python wrapper for REAPER☆81Jan 22, 2025Updated last year
- This is a telegram bot for correcting language mistakes in group chats☆10Jun 29, 2021Updated 4 years ago
- This is a fork of tortoise tts fast to easily create audio books locally on your computer☆12Apr 24, 2024Updated last year
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- ☆10Apr 24, 2024Updated last year
- Since August 2023 We r improving Qaamuska iyo Erayada Afka-Soomaliga(Somali Dictionary and Vocabulary)☆18Oct 16, 2025Updated 4 months ago
- Training scripts for Speech-To-Text models for Ukrainian language☆40Aug 28, 2023Updated 2 years ago
- ☆15Mar 15, 2022Updated 3 years ago
- Top 3% in Kaggle housing competition☆10Feb 6, 2021Updated 5 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆42Feb 4, 2026Updated last month
- Modular Synthesizer Studio☆37Feb 9, 2026Updated 3 weeks ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- Rababa, the diacritization library for Arabic and Hebrew (Abjad scripts in general)☆13May 1, 2025Updated 10 months ago