eligugliotta / tarcLinks
Tunisian Arabish Corpus
☆11Updated last year
Alternatives and similar repositories for tarc
Users that are interested in tarc are comparing it to the libraries listed below
Sorting:
- MAFAND-MT☆60Updated last year
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆63Updated last year
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- Arabic cleaning, normalization and segmentation library.☆72Updated 2 years ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Updated 2 years ago
- ☆17Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆35Updated 10 months ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆80Updated 3 years ago
- Crosslingual Question Answering for African Languages☆30Updated last year
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆31Updated 4 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆56Updated 3 months ago
- NTREX -- News Test References for MT Evaluation☆86Updated last year
- ☆127Updated last year
- Arabic Tokenization Library. It provides many tokenization algorithms.☆110Updated 2 years ago
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREME☆105Updated 9 months ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆43Updated 9 months ago
- مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (AN…☆84Updated last month
- ☆22Updated 3 years ago
- ☆41Updated 3 years ago
- ParaNames: A multilingual resource for parallel names☆39Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Updated last year
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆113Updated 4 years ago
- Several deep learning models for restoring Arabic diacritics using Pytorch.☆36Updated 3 years ago
- Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model☆59Updated 10 months ago
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆90Updated 6 months ago
- A comprehensive list of Arabic NLP resources.☆43Updated 4 months ago
- zero shot NER fine tuning☆13Updated 9 months ago
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆25Updated 3 years ago
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆87Updated last year