proger / uk4b
GPT-2 Metadata Pretraining Towards Instruction Finetuning for Ukrainian
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for uk4b
- ☆23Updated 2 years ago
- A corpus of Ukrainian Twitter texts + instructions for downloading and filtering texts.☆15Updated 5 years ago
- Agent toolkit for 100 hours of speech and 10 GiB of text☆13Updated 8 months ago
- Фонограми та синтагми: інструменти обробки☆21Updated 9 months ago
- Dictionary of obscene words for Ukrainian language☆17Updated 3 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated last year
- ☆26Updated last year
- A collection of links to Ukrainian language tools☆30Updated 2 years ago
- Ukrainian ELECTRA model☆12Updated last year
- Training BERT for punctuation task☆10Updated 4 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- A collection of datasets for Ukrainian language☆55Updated 3 months ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated 3 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 8 months ago
- Ukrainian instruction-tuned language models and datasets☆84Updated 3 months ago
- ☆13Updated 3 years ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Updated 3 years ago
- ☆28Updated 6 months ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆26Updated 2 months ago
- ☆19Updated 5 years ago
- Training scripts for Speech-To-Text models for Ukrainian language☆34Updated last year
- Smart Language Model☆47Updated last year
- ☆56Updated last year
- Library for fast text representation and classification.☆28Updated 10 months ago
- phone inventory library☆15Updated last year
- ☆11Updated 3 years ago
- Convert words to numbers☆20Updated 2 years ago
- T5-based (russian) text normalization☆19Updated 9 months ago