UBC-NLP / peacock
This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.
☆24Updated 2 months ago
Alternatives and similar repositories for peacock:
Users that are interested in peacock are comparing it to the libraries listed below
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- ☆120Updated 11 months ago
- ☆39Updated 6 months ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆34Updated 11 months ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆16Updated 6 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 8 months ago
- أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs☆13Updated 3 weeks ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆52Updated last year
- Code for Arabic Nougat☆38Updated 2 months ago
- Arabic nested named entity recognition☆33Updated 9 months ago
- The official implementation of CATT Arabic diacritization models.☆39Updated last month
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆93Updated 2 months ago
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility and…☆95Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 7 months ago
- ArabicaQA: Comprehensive Dataset for Arabic Question Answering accepted at SIGIR 2024☆13Updated 6 months ago
- مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (AN…☆78Updated 2 months ago
- This repo is for semantic search app to search over Quran tafsir books☆24Updated 7 months ago
- ☆28Updated 2 months ago
- Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model☆57Updated 7 months ago
- Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques☆143Updated 2 weeks ago
- 🧰 The AutoTokenizer that TikToken always needed -- Load any tokenizer with TikToken now! ✨☆37Updated last month
- Arabic cleaning, normalization and segmentation library.☆66Updated last year
- Chunk your text using gpt4o-mini more accurately☆43Updated 6 months ago
- Arabic Tokenization Library. It provides many tokenization algorithms.☆100Updated last year
- AraT5: Text-to-Text Transformers for Arabic Language Understanding☆88Updated 9 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆61Updated this week
- Efficiently find the best-suited language model (LM) for your NLP task☆116Updated this week
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- ☆140Updated 7 months ago