fido-ai / ua-datasetsView external linksLinks
A collection of datasets for Ukrainian language
☆57Oct 26, 2025Updated 3 months ago
Alternatives and similar repositories for ua-datasets
Users that are interested in ua-datasets are comparing it to the libraries listed below
Sorting:
- A corpus of Ukrainian Twitter texts + instructions for downloading and filtering texts.☆15Jul 4, 2019Updated 6 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Oct 21, 2025Updated 3 months ago
- UNLP 2025 Shared Task on Detecting Social Media Manipulation☆23Aug 4, 2025Updated 6 months ago
- Ukrainian instruction-tuned language models and datasets☆96Jul 12, 2024Updated last year
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 2 years ago
- ☆15Oct 29, 2024Updated last year
- The list of Ukrainian words for sentiment analysis and NLP☆15Sep 5, 2021Updated 4 years ago
- A collection of links to Ukrainian language tools☆39Apr 27, 2022Updated 3 years ago
- Experimental repository for NER (Named-entity recognition) for sentences of Ukrainian language.☆13Aug 13, 2021Updated 4 years ago
- the list of ~2000 ukrainian stopwords (with numbers)☆66May 20, 2021Updated 4 years ago
- Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)☆227Nov 3, 2025Updated 3 months ago
- GPT-2 Metadata Pretraining Towards Instruction Finetuning for Ukrainian☆20Aug 6, 2023Updated 2 years ago
- Dictionary of obscene words for Ukrainian language☆22May 15, 2025Updated 9 months ago
- Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"☆25Jun 28, 2023Updated 2 years ago
- Фонограми та синтагми: інструменти обробки☆21Jun 21, 2025Updated 7 months ago
- Fun pet project for creating Ukrainian-speaking Conversational AI☆20May 4, 2023Updated 2 years ago
- Official repo for the paper "Make Some Noise: Reliable and Efficient Single-Step Adversarial Training" (https://arxiv.org/abs/2202.01181)☆25Oct 17, 2022Updated 3 years ago
- ☆27Jun 12, 2023Updated 2 years ago
- English to Ukrainian dictionary☆30Jul 12, 2023Updated 2 years ago
- UCU Audio Processing Course☆39Updated this week
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Modern partition manager for PostgreSQL☆17May 18, 2023Updated 2 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- A simple yet powerful data validator for javascript.☆12Jan 7, 2023Updated 3 years ago
- Unity package for cutting the selected area of the mesh for HoloLens.☆10Sep 21, 2020Updated 5 years ago
- Training scripts for Speech-To-Text models for Ukrainian language☆40Aug 28, 2023Updated 2 years ago
- This is a telegram bot for correcting language mistakes in group chats☆10Jun 29, 2021Updated 4 years ago
- ☆11Dec 21, 2023Updated 2 years ago
- Online streaming speaker change detection model in Pytorch☆44Apr 14, 2023Updated 2 years ago
- Newsdata.io Official Python Client☆14Jan 14, 2026Updated last month
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Hackathon project for Snarky workshop.☆11Jun 21, 2019Updated 6 years ago
- Cross-platform gamepad library for nim☆12May 13, 2023Updated 2 years ago
- Magic Wormhole for Haskell☆11Apr 23, 2024Updated last year
- A simple GitHub search client built with Vue 3 and Apollo.☆12Mar 5, 2021Updated 4 years ago
- PHP app which allows you to get parts of a library as separate zips. Developed and tested with Zend Framework☆14Apr 15, 2010Updated 15 years ago
- Async web scraping framework on top of Rust. Works with Free-threaded Python (`PYTHON_GIL=0`).☆24Updated this week
- ☆10Mar 16, 2024Updated last year
- Do not use! Fuel's ORM now has built-in support for nested sets!☆17Apr 27, 2014Updated 11 years ago