A corpus of Ukrainian Twitter texts + instructions for downloading and filtering texts.
☆15Jul 4, 2019Updated 6 years ago
Alternatives and similar repositories for ukr-twi-corpus
Users that are interested in ukr-twi-corpus are comparing it to the libraries listed below
Sorting:
- Dictionary of obscene words for Ukrainian language☆22May 15, 2025Updated 9 months ago
- UNLP 2024 Shared Task on LLM instruction-tuning for Ukrainian☆17Apr 15, 2024Updated last year
- ☆27Jun 12, 2023Updated 2 years ago
- GPT-2 Metadata Pretraining Towards Instruction Finetuning for Ukrainian☆20Aug 6, 2023Updated 2 years ago
- A collection of datasets for Ukrainian language☆57Oct 26, 2025Updated 4 months ago
- Data from "Crowdsourcing of Parallel Corpora: the Case of Style Transfer for Detoxification" paper☆14Apr 3, 2025Updated 11 months ago
- ☆23Jan 21, 2022Updated 4 years ago
- Home of Projector's "Data Science. Natural Language Processing" 2020 Edition☆19Oct 3, 2023Updated 2 years ago
- Code for the paper "Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression"☆25Jun 28, 2023Updated 2 years ago
- Adds word stress to Ukrainian texts☆59Sep 29, 2024Updated last year
- ☆28May 7, 2015Updated 10 years ago
- HR Analytics Dataset☆10Mar 29, 2019Updated 6 years ago
- ☆29Nov 12, 2025Updated 3 months ago
- ☆15Nov 16, 2015Updated 10 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Visualizing the activity of many concurrent processes☆59Jul 7, 2020Updated 5 years ago
- Modern partition manager for PostgreSQL☆17May 18, 2023Updated 2 years ago
- Streamlit deployment on AWS Fargate☆12Jul 1, 2020Updated 5 years ago
- An elaborate approach for ABC-XYZ Analysis☆11May 10, 2020Updated 5 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- Wavenet conditioned on midi for music synthesis☆37Jun 6, 2019Updated 6 years ago
- UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language☆269Feb 11, 2024Updated 2 years ago
- A formalization of the Dedekind real numbers in Coq [maintainer=@andrejbauer]☆45Jul 14, 2024Updated last year
- Example how to append data to a Haskell executable using sqlite☆10Mar 16, 2020Updated 5 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Datasets for compositional learning☆11Nov 28, 2018Updated 7 years ago
- ☆11Oct 19, 2024Updated last year
- Hackathon project for Snarky workshop.☆11Jun 21, 2019Updated 6 years ago
- ☆26Sep 3, 2025Updated 6 months ago
- XML Type for Yjs☆12Oct 2, 2017Updated 8 years ago
- ☆10Mar 11, 2024Updated last year
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 2 years ago
- An attempt to formalize unix cat in fiat☆11May 28, 2017Updated 8 years ago
- The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Dat…☆14Apr 21, 2023Updated 2 years ago
- ☆10Mar 16, 2024Updated last year
- The official Languini Kitchen repository☆14May 6, 2024Updated last year
- A Visualizer for prosodically annotated speech corpora☆12Oct 27, 2021Updated 4 years ago