instadeepai / tunbertLinks
TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT was applied to three NLP downstream tasks: Sentiment Analysis (SA), Tunisian Dialect Identification (TDI) and Reading Comprehension Question-Answering (RCQA)
☆117Updated 2 years ago
Alternatives and similar repositories for tunbert
Users that are interested in tunbert are comparing it to the libraries listed below
Sorting:
- Tunisian Sentiment Analysis Corpus.☆27Updated 4 years ago
- AraT5: Text-to-Text Transformers for Arabic Language Understanding☆90Updated last year
- Sentiment Analysis in Arabic tweets☆74Updated 5 years ago
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆108Updated 3 years ago
- DziriBERT: a Pre-trained Language Model for the Algerian Dialect☆163Updated 2 years ago
- ☆72Updated last year
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆165Updated last year
- Arabic edition of BERT pretrained language models☆129Updated 4 years ago
- مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (AN…☆80Updated 3 weeks ago
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆172Updated last week
- Arabic Tokenization Library. It provides many tokenization algorithms.☆106Updated last year
- ☆38Updated 2 years ago
- A Python implementation of Farasa toolkit☆129Updated 2 weeks ago
- ☆30Updated 5 years ago
- Generating Arabic poetry using Markov chains.☆111Updated 3 years ago
- Neural Arabic text diacritization☆91Updated 2 years ago
- This repo contains Arabic OCR App☆60Updated 2 years ago
- A tiny wrapper for Arabic WordCloud plots☆10Updated 5 years ago
- A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic di…☆83Updated last year
- Arabic cleaning, normalization and segmentation library.☆70Updated last year
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆53Updated 2 years ago
- A morphosyntactic analyzer for the Arabic language.☆23Updated 5 years ago
- ☆40Updated 3 years ago
- Python library used for Arabic NLP to process, prepare and clean the Arabic text☆16Updated 11 months ago
- Arabic NLP tools List inventory☆87Updated 2 years ago
- ☆50Updated 2 years ago
- Maha is a text processing library specially developed to deal with Arabic text.☆208Updated last month
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)☆106Updated 8 years ago
- GloVe model for distributed arabic word representation☆34Updated 2 years ago
- A deep learning model to classify the Arabic letters and digits easily.☆64Updated 4 years ago