Zero-shot Transfer Learning from English to Arabic
☆30Jun 22, 2022Updated 4 years ago
Alternatives and similar repositories for GigaBERT
Users that are interested in GigaBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆29Feb 18, 2021Updated 5 years ago
- Arabic edition of BERT pretrained language models☆133Dec 5, 2020Updated 5 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- Code and data for Veridicality classifier on Twitter☆11May 23, 2018Updated 8 years ago
- ☆15Jun 8, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- GPT-jax based on the official huggingface library☆13Jun 22, 2021Updated 5 years ago
- A Python implementation of Farasa toolkit☆142Sep 11, 2025Updated 9 months ago
- Arabic edition of ALBERT pretrained language models☆15Apr 25, 2021Updated 5 years ago
- litrl browser and detectors☆10Oct 5, 2023Updated 2 years ago
- Code repository for our paper, "Medical Large Language Models are Vulnerable to Data Poisoning Attacks" (Nature Medicine, 2024).☆13Jan 5, 2025Updated last year
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated 2 years ago
- Official FIRE 2020 Authorship Identification of SOurce COde (AI-SOCO) task repository containing dataset, evaluation tools and baselines☆19May 22, 2023Updated 3 years ago
- ☆17Dec 12, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Jul 27, 2023Updated 2 years ago
- Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)☆727Oct 17, 2022Updated 3 years ago
- ☆15Sep 27, 2022Updated 3 years ago
- Can Large Language Models Identify Authorship? (EMNLP 2024 Findings)☆13Feb 4, 2025Updated last year
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits☆23Sep 23, 2017Updated 8 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆13Aug 2, 2022Updated 3 years ago
- Examples and templates of aws automation with terraform☆13May 13, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TEAD : Large Scale Arabic Dataset for Sentiment Analysis☆12Oct 16, 2018Updated 7 years ago
- Code for the paper "Modelling Latent Translations for Cross-Lingual Transfer"☆17Nov 22, 2021Updated 4 years ago
- Arabic Stop Word List☆38Jan 11, 2024Updated 2 years ago
- Generating Annotation Spreadsheet for QA-SRL Scheme☆12Feb 14, 2017Updated 9 years ago
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.☆560Jun 8, 2026Updated 3 weeks ago
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆117Sep 2, 2021Updated 4 years ago
- This is an official repository for "Artificial Text Detection via Examining the Topology of Attention Maps" presented at EMNLP 2021 confe…☆24Sep 5, 2023Updated 2 years ago
- Code for paper https://arxiv.org/abs/2501.00522☆15Apr 28, 2025Updated last year
- ☆13May 26, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Python package to deal with PAN corpora and extract stylometric features from text documents.☆15Nov 11, 2022Updated 3 years ago
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network☆20Jul 26, 2021Updated 4 years ago
- Scripts for WASSA-2017 Shared Task on Emotion Intensity☆14Oct 4, 2017Updated 8 years ago
- Named Entity Recognition System for Arabic☆21Nov 29, 2022Updated 3 years ago
- ☆15Jul 29, 2024Updated last year
- This repo supports various cross-lingual transfer learning & multilingual NLP models.☆92Sep 13, 2023Updated 2 years ago