lang-uk / dragoman
☆13Updated 5 months ago
Alternatives and similar repositories for dragoman:
Users that are interested in dragoman are comparing it to the libraries listed below
- ☆23Updated 3 years ago
- Dictionary of obscene words for Ukrainian language☆18Updated 3 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Training scripts for Speech-To-Text models for Ukrainian language☆35Updated last year
- ☆13Updated 3 years ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Updated last year
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated 8 months ago
- ☆26Updated last year
- GPT-2 Metadata Pretraining Towards Instruction Finetuning for Ukrainian☆19Updated last year
- Lightweight knowledge distillation pipeline☆28Updated 3 years ago
- Фонограми та синтагми: інструменти обробки☆21Updated 3 months ago
- Русско-Английский вокодер на GAN☆17Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- Python package to generate IPA (international phonetic alphabet) for ukrainian words☆9Updated 11 months ago
- Fun pet project for creating Ukrainian-speaking Conversational AI☆19Updated last year
- Speech analytics package for call-center☆23Updated 4 years ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆28Updated 6 months ago
- ☆20Updated 5 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 3 weeks ago
- Adds word stress to Ukrainian texts☆50Updated 6 months ago
- Training BERT for punctuation task☆10Updated 4 years ago
- Speech in Flax/JAX☆15Updated 2 years ago
- ☆56Updated 2 years ago
- Home of Projector's "Data Science. Natural Language Processing" 2020 Edition☆19Updated last year
- T5-based (russian) text normalization☆20Updated last year
- Agent toolkit for 100 hours of speech and 10 GiB of text☆13Updated last year
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- phone inventory library☆16Updated last year
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago