Zero-shot Transfer Learning from English to Arabic
☆30Jun 22, 2022Updated 3 years ago
Alternatives and similar repositories for GigaBERT
Users that are interested in GigaBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Arabic - English emotion lexicon☆12Apr 24, 2017Updated 8 years ago
- Nile University's Arabic sentiment Lexicon☆17Nov 24, 2016Updated 9 years ago
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆26Feb 18, 2021Updated 5 years ago
- Arabic edition of BERT pretrained language models☆133Dec 5, 2020Updated 5 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- UDPipe based preprocessing of the ACE05 dataset☆18Jun 7, 2020Updated 5 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- A Python implementation of Farasa toolkit☆139Sep 11, 2025Updated 6 months ago
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 4 years ago
- Code repository for our paper, "Medical Large Language Models are Vulnerable to Data Poisoning Attacks" (Nature Medicine, 2024).☆12Jan 5, 2025Updated last year
- Official FIRE 2020 Authorship Identification of SOurce COde (AI-SOCO) task repository containing dataset, evaluation tools and baselines☆19May 22, 2023Updated 2 years ago
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Jul 27, 2023Updated 2 years ago
- ☆17Dec 12, 2024Updated last year
- Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)☆712Oct 17, 2022Updated 3 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits☆23Sep 23, 2017Updated 8 years ago
- Examples and templates of aws automation with terraform☆13May 13, 2023Updated 2 years ago
- TEAD : Large Scale Arabic Dataset for Sentiment Analysis☆12Oct 16, 2018Updated 7 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Nov 22, 2021Updated 4 years ago
- Arabic Stop Word List☆36Jan 11, 2024Updated 2 years ago
- Generating Annotation Spreadsheet for QA-SRL Scheme☆12Feb 14, 2017Updated 9 years ago
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.☆540Mar 5, 2026Updated 2 weeks ago
- Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب☆329Mar 27, 2024Updated last year
- Code for paper https://arxiv.org/abs/2501.00522☆14Apr 28, 2025Updated 10 months ago
- ☆13May 26, 2021Updated 4 years ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 5 years ago
- Material for the Text Analysis of Arabic course taught at the NYU Abu Dhabi Winter Institute in Digital Humanities 2020.☆15Jan 30, 2020Updated 6 years ago
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network☆20Jul 26, 2021Updated 4 years ago
- Scripts for WASSA-2017 Shared Task on Emotion Intensity☆14Oct 4, 2017Updated 8 years ago
- Named Entity Recognition System for Arabic☆20Nov 29, 2022Updated 3 years ago
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Jul 28, 2022Updated 3 years ago
- COVID-19 Infodemic Twitter dataset☆13Sep 5, 2021Updated 4 years ago
- ☆17Aug 27, 2018Updated 7 years ago
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- All resources created and used in Arabic Sentiment Analysis of Arabic Tweets. Includes Sentiment lexicon generated from Arabic tweets and…☆14Dec 21, 2021Updated 4 years ago