UBC-NLP / aoc_idLinks
Arabic Dialect Identification on AOC data.
☆24Updated 6 years ago
Alternatives and similar repositories for aoc_id
Users that are interested in aoc_id are comparing it to the libraries listed below
Sorting:
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- Zero-shot Transfer Learning from English to Arabic☆30Updated 3 years ago
- Arabic edition of BERT pretrained language models☆132Updated 5 years ago
- Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.☆54Updated last year
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆112Updated 4 years ago
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Updated 5 years ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆11Updated 3 years ago
- ☆15Updated 4 years ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆35Updated 3 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆11Updated 3 years ago
- ☆45Updated 3 years ago
- Arabic NER system with a strong performance☆36Updated 5 years ago
- A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Updated 11 years ago
- ☆43Updated 10 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated 2 years ago
- Arabic edition of ALBERT pretrained language models☆16Updated 4 years ago
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)☆107Updated 8 years ago
- Curated list of publicly available parallel corpus for Indian Languages☆36Updated 4 years ago
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 3 years ago
- Arabic Language Model based on Bert☆19Updated 5 years ago
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Updated 4 years ago
- XAI Tutorial for the Explainable AI track in the ALPS winter school 2021☆58Updated 4 years ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Updated 3 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- This is a repository of the Multi-dialect Arabic BERT model.☆38Updated 5 years ago
- Appraise code used as part of WMT21 human evaluation campaign☆29Updated this week
- ☆55Updated 3 years ago
- This repo contains a set of neural transducer, e.g. sequence-to-sequence model, focusing on character-level tasks.☆76Updated 2 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆110Updated 2 years ago