IS2AI / KazQADLinks
An open-source Kazakh Question Answering Dataset
☆8Updated 9 months ago
Alternatives and similar repositories for KazQAD
Users that are interested in KazQAD are comparing it to the libraries listed below
Sorting:
- An open-source Kazakh named entity recognition dataset (KazNERD), annotation guidelines, and baseline NER models.☆30Updated 5 months ago
- NLP tools for Kazakh language☆33Updated 3 years ago
- NEREL: A Russian Dataset with Nested Named Entities, Relations and Events☆31Updated last year
- NLP tools for Kazakh language☆46Updated 4 years ago
- Материалы курса "Компьютерная лингвистика и информационные технологии" для 4-го курса бак алавриата направления "Фундаментальная и приклад…☆9Updated 4 years ago
- Datasets for the task of tracing diachronic semantic shifts in Russian for two large-scale time period pairs (from pre-Soviet to Soviet t…☆14Updated 4 months ago
- Repository for "BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation", accepted at EAMT 2…☆19Updated last year
- Neural CRF Model for Sentence Alignment in Text Simplification☆68Updated 5 months ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)☆11Updated 3 years ago
- SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages☆9Updated last year
- Improved Sentence Alignment in Linear Time and Space☆174Updated 2 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆155Updated last month
- OpusFilter - Parallel corpus processing toolkit☆104Updated this week
- Efficient Low-Memory Aligner☆145Updated 5 months ago
- A neural word aligner based on multilingual BERT☆350Updated 3 years ago
- ☆25Updated 3 months ago
- A Parallel Russian-Simple Russian Dataset☆10Updated 2 years ago
- Repository for DISRPT2023 shared task☆17Updated 11 months ago
- NLP course @ CS Faculty, HSE☆15Updated 5 years ago
- Repository to collect and categorize Grammatical Error Correction papers.☆119Updated 2 months ago
- Convert Standard M2 format to parallel sentences.☆22Updated 5 years ago
- a tool for calcualting character n-gram F score☆73Updated 2 years ago
- How to finetune mbart using fairseq☆24Updated 4 years ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆26Updated 2 years ago
- cLang-8 is a dataset for grammatical error correction.☆106Updated 2 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Updated 3 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- Morphological Inflection for Low-Resource Languages using cross-lingual transfer☆20Updated 5 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago