semantic-systems / amharic-qaLinks
AmQA - The first Amharic Open Domain Question Answering Dataset
☆12Updated last year
Alternatives and similar repositories for amharic-qa
Users that are interested in amharic-qa are comparing it to the libraries listed below
Sorting:
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆94Updated 2 years ago
- PyTorch – SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models.☆62Updated 3 years ago
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆74Updated last year
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Updated 4 years ago
- MAFAND-MT☆57Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆73Updated last year
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- ☆22Updated 3 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- ☆66Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆60Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆114Updated 2 years ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Updated 2 years ago
- Multi-task modelling extensions for huggingface transformers☆21Updated 2 years ago
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆102Updated 7 months ago
- 🌏 Modular retrievers for zero-shot multilingual IR.☆28Updated last year
- TimeLMs: Diachronic Language Models from Twitter☆110Updated last year
- ☆75Updated 4 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆49Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆137Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆132Updated last year
- This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 4…☆274Updated last year
- ☆44Updated 4 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Updated 2 years ago
- Bi-encoder entity linking architecture☆50Updated 11 months ago
- Code for the CRAC 2021 paper "On Generalization in Coreference Resolution" (Best short paper award)☆35Updated 2 years ago
- Tools for managing datasets for governance and training.☆83Updated 3 weeks ago