semantic-systems / amharic-qaLinks
AmQA - The first Amharic Open Domain Question Answering Dataset
☆12Updated last year
Alternatives and similar repositories for amharic-qa
Users that are interested in amharic-qa are comparing it to the libraries listed below
Sorting:
- MAFAND-MT☆57Updated last year
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆74Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆114Updated 2 years ago
- Multi-task modelling extensions for huggingface transformers☆20Updated 2 years ago
- Long-context pretrained encoder-decoder models☆96Updated 2 years ago
- Bi-encoder entity linking architecture☆47Updated 11 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆201Updated last year
- ☆86Updated 4 months ago
- Multilingual Generative Pretrained Model☆205Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆132Updated last year
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 3 years ago
- Do Multilingual Language Models Think Better in English?☆42Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- DialogSum: A Real-life Scenario Dialogue Summarization Dataset - Findings of ACL 2021☆180Updated 8 months ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Updated 2 years ago
- An official repository for MIA 2022 (NAACL 2022 Workshop) Shared Task on Cross-lingual Open-Retrieval Question Answering.☆31Updated 3 years ago
- TimeLMs: Diachronic Language Models from Twitter☆109Updated last year
- ☆78Updated last year
- An instruction-based benchmark for text improvements.☆141Updated 2 years ago
- (NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations☆13Updated 3 months ago
- ☆11Updated last year
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- ☆101Updated 2 years ago
- Observe the slow deterioration of my mental sanity in the github commit history☆12Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆78Updated 3 years ago
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 3 years ago
- This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…☆26Updated 2 years ago
- A question-answering dataset with a focus on subjective information☆45Updated last year