UBC-NLP / aoc_idLinks
Arabic Dialect Identification on AOC data.
☆24Updated 6 years ago
Alternatives and similar repositories for aoc_id
Users that are interested in aoc_id are comparing it to the libraries listed below
Sorting:
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- ☆14Updated 4 years ago
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated 2 years ago
- ☆45Updated 3 years ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Updated 3 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Updated 2 years ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆11Updated 2 years ago
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Updated 4 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆110Updated 3 years ago
- ☆110Updated last year
- Arabic edition of BERT pretrained language models☆130Updated 4 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆11Updated 3 years ago
- Appraise code used as part of WMT21 human evaluation campaign☆28Updated 2 weeks ago
- Codebase for probing and visualizing multilingual models.☆49Updated 5 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆32Updated 5 months ago
- Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.☆50Updated last year
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- ☆74Updated last week
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆66Updated 3 weeks ago
- This is a repository of the Multi-dialect Arabic BERT model.☆38Updated 5 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated last year
- Improving Low-Resource Neural Machine Translation of Related Languages by Transfer Learning☆18Updated 2 years ago
- Arabic NER system with a strong performance☆36Updated 5 years ago
- GlossBERT: BERT for Word Sense Disambiguation with Gloss Knowledge (EMNLP 2019)☆96Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆85Updated last year
- ☆54Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated 2 years ago