Zero-shot Transfer Learning from English to Arabic
☆30Jun 22, 2022Updated 3 years ago
Alternatives and similar repositories for GigaBERT
Users that are interested in GigaBERT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Arabic - English emotion lexicon☆12Apr 24, 2017Updated 9 years ago
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆29Feb 18, 2021Updated 5 years ago
- Arabic edition of BERT pretrained language models☆133Dec 5, 2020Updated 5 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- Code and data for Veridicality classifier on Twitter☆11May 23, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Jun 8, 2021Updated 5 years ago
- UDPipe based preprocessing of the ACE05 dataset☆18Jun 7, 2020Updated 6 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- GPT-jax based on the official huggingface library☆13Jun 22, 2021Updated 4 years ago
- EMNLP 2022 Demo "SynKB: Semantic Search for Chemical Synthesis Procedures"☆17Oct 31, 2022Updated 3 years ago
- A Python implementation of Farasa toolkit☆141Sep 11, 2025Updated 9 months ago
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 5 years ago
- Arabic NER system with a strong performance☆36Mar 12, 2020Updated 6 years ago
- litrl browser and detectors☆10Oct 5, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code repository for our paper, "Medical Large Language Models are Vulnerable to Data Poisoning Attacks" (Nature Medicine, 2024).☆13Jan 5, 2025Updated last year
- Accessible Technology Anywhere☆19Oct 8, 2015Updated 10 years ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- distilled Self-Critique refines the outputs of a LLM with only synthetic data☆11Apr 11, 2024Updated 2 years ago
- Repository for NLP project. Name to be changed when we decide on a project☆16Apr 19, 2022Updated 4 years ago
- Official FIRE 2020 Authorship Identification of SOurce COde (AI-SOCO) task repository containing dataset, evaluation tools and baselines☆19May 22, 2023Updated 3 years ago
- My solution in Zindi Tunisian Sentiment Analysis competition. Ranked #1st.☆12Jun 8, 2021Updated 5 years ago
- ☆17Dec 12, 2024Updated last year
- C4RepSet: Representative Subset from C4 data for Training Pre-trained LMs☆11Jan 13, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Jul 27, 2023Updated 2 years ago
- Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)☆723Oct 17, 2022Updated 3 years ago
- Can Large Language Models Identify Authorship? (EMNLP 2024 Findings)☆13Feb 4, 2025Updated last year
- Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits☆23Sep 23, 2017Updated 8 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆13Aug 2, 2022Updated 3 years ago
- Examples and templates of aws automation with terraform☆13May 13, 2023Updated 3 years ago
- TEAD : Large Scale Arabic Dataset for Sentiment Analysis☆12Oct 16, 2018Updated 7 years ago
- Arabic Stop Word List☆38Jan 11, 2024Updated 2 years ago
- Generating Annotation Spreadsheet for QA-SRL Scheme☆12Feb 14, 2017Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.☆554Jun 5, 2026Updated last week
- Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب☆333Mar 27, 2024Updated 2 years ago
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆117Sep 2, 2021Updated 4 years ago
- Python package to deal with PAN corpora and extract stylometric features from text documents.☆15Nov 11, 2022Updated 3 years ago
- A very basic Arabic OCR based on tesseract OCR engine written in Java.☆21Sep 21, 2015Updated 10 years ago
- Python library for backtranslation (with Google Translate)☆12Jan 11, 2020Updated 6 years ago
- Scripts for WASSA-2017 Shared Task on Emotion Intensity☆14Oct 4, 2017Updated 8 years ago