semantic-systems / amharic-qa
AmQA - The first Amharic Open Domain Question Answering Dataset
☆11Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for amharic-qa
- Different semantic models for Amharic☆17Updated 10 months ago
- notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification datas…☆9Updated 6 months ago
- A library of translation-based text similarity measures☆25Updated 11 months ago
- ☆11Updated 4 months ago
- ☆16Updated last year
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆31Updated 2 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆11Updated 3 years ago
- GupShup: Summarizing Open-Domain Code-Switched Conversations EMNLP 2021☆15Updated 3 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 8 months ago
- The large-scale MultiLingual SUMmarization corpus☆26Updated 2 years ago
- ☆37Updated last year
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14Updated 4 years ago
- ☆23Updated 2 years ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆12Updated 2 years ago
- DiscoScore: Evaluating Text Generation with BERT and Discourse Coherence☆35Updated last year
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated last year
- ☆26Updated 10 months ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Updated last year
- ☆97Updated 2 years ago
- This repository contains the PyTorch implementation of the paper STaCK: Sentence Ordering with Temporal Commonsense Knowledge appearing a…☆28Updated last year
- PyTorch reimplementation of REALM and ORQA☆22Updated 2 years ago
- ☆20Updated 3 years ago
- ☆24Updated 5 months ago
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 2 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆32Updated last year
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- A question-answering dataset with a focus on subjective information☆43Updated 10 months ago