h9-tect / llama2-qlora-finetunined-ArabicLinks
☆10Updated last year
Alternatives and similar repositories for llama2-qlora-finetunined-Arabic
Users that are interested in llama2-qlora-finetunined-Arabic are comparing it to the libraries listed below
Sorting:
- ☆124Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated last year
- ☆27Updated 10 months ago
- Arabic poetry analysis and generation.☆22Updated last year
- Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.☆16Updated 10 months ago
- Our submission for quran QA shared-task. Fortunately, this work achieved the first place among accepted papers.☆19Updated 6 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- Arabic Tokenization Library. It provides many tokenization algorithms.☆107Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆83Updated last week
- This repository is the implementation of a Transformer model called MarianCG which is developed for the Code Generation problem.☆21Updated 2 years ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆20Updated 11 months ago
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabic☆109Updated 3 years ago
- Resources for cultural NLP research☆98Updated 2 months ago
- A Multilingual Replicable Instruction-Following Model☆94Updated 2 years ago
- Arabic cleaning, normalization and segmentation library.☆70Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆94Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆127Updated 10 months ago
- Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.☆48Updated last year
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆10Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆40Updated 3 months ago
- Benchmarking Large Language Models☆99Updated 3 weeks ago
- Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model☆58Updated 4 months ago
- ☆13Updated last week
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆42Updated 8 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆131Updated last year
- Synthetic Data Generation for Evaluation☆15Updated 4 months ago
- This repository contains the Arabic sarcasm dataset (ArSarcasm)☆24Updated 4 years ago
- SemEval 2024 Task 1 : Textual Semantic Relatedness☆25Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆73Updated last year