GPT-2 Metadata Pretraining Towards Instruction Finetuning for Ukrainian
☆20Aug 6, 2023Updated 2 years ago
Alternatives and similar repositories for uk4b
Users that are interested in uk4b are comparing it to the libraries listed below
Sorting:
- Фонограми та синтагми: інструменти обробки☆21Jun 21, 2025Updated 8 months ago
- Agent toolkit for 100 hours of speech and 10 GiB of text☆14Jul 15, 2025Updated 7 months ago
- A corpus of Ukrainian Twitter texts + instructions for downloading and filtering texts.☆15Jul 4, 2019Updated 6 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Oct 21, 2025Updated 4 months ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 2 years ago
- ☆15Oct 29, 2024Updated last year
- ☆27Jun 12, 2023Updated 2 years ago
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- UNLP 2024 Shared Task on LLM instruction-tuning for Ukrainian☆17Apr 15, 2024Updated last year
- Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)☆227Feb 19, 2026Updated last week
- Speech in Flax/JAX☆15Jul 11, 2022Updated 3 years ago
- PyTorch Language Modeling Toolkit for Fast Weight Programmers☆19Jun 11, 2025Updated 8 months ago
- ☆23Jan 21, 2022Updated 4 years ago
- Dictionary of obscene words for Ukrainian language☆22May 15, 2025Updated 9 months ago
- Браунський корпус української мови☆118Feb 22, 2026Updated last week
- Haskell + nixpkgs = nix-hs☆24Jun 2, 2021Updated 4 years ago
- UNLP 2025 Shared Task on Detecting Social Media Manipulation☆23Aug 4, 2025Updated 6 months ago
- Fun pet project for creating Ukrainian-speaking Conversational AI☆20May 4, 2023Updated 2 years ago
- Ukrainian instruction-tuned language models and datasets☆96Jul 12, 2024Updated last year
- Staged Training for Transformer Language Models☆33Mar 31, 2022Updated 3 years ago
- LTG-Bert☆34Jan 8, 2024Updated 2 years ago
- Invertible parsing for S-expressions☆34Feb 4, 2026Updated 3 weeks ago
- Text language identification using Wikipedia data☆31Aug 15, 2017Updated 8 years ago
- Viterbi decoding in PyTorch☆40Sep 10, 2025Updated 5 months ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆34Aug 6, 2023Updated 2 years ago
- A collection of links to Ukrainian language tools☆39Apr 27, 2022Updated 3 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Apr 24, 2020Updated 5 years ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆39Jun 11, 2025Updated 8 months ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Jan 9, 2025Updated last year
- Structured Prediction for Entity Linking☆38Aug 2, 2024Updated last year
- A JAX library for building lattice-based speech transducer models☆46Updated this week
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- Training scripts for Speech-To-Text models for Ukrainian language☆40Aug 28, 2023Updated 2 years ago
- Машинне навчання для інженерів із систем керування☆11Jul 19, 2023Updated 2 years ago
- This is a telegram bot for correcting language mistakes in group chats☆10Jun 29, 2021Updated 4 years ago
- ☆15Mar 15, 2022Updated 3 years ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆47Feb 19, 2025Updated last year
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- ☆10Oct 2, 2024Updated last year