This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.
☆26Dec 9, 2024Updated last year
Alternatives and similar repositories for peacock
Users that are interested in peacock are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- ☆38Feb 7, 2026Updated 3 months ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Apr 3, 2022Updated 4 years ago
- FrugalScore is an approach to learn a fixed, low cost version of any expensive NLG metric, while retaining most of its original performan…☆16Sep 21, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆46Apr 3, 2025Updated last year
- Explore the content of Arabic text datasets.☆18May 23, 2022Updated 4 years ago
- Manazir OCR — Arabic-first, optics-inspired multi-model OCR. Extracts high-quality text and layout (HTML/Markdown) from Arabic documents …☆41Nov 2, 2025Updated 6 months ago
- ☆37May 28, 2023Updated 2 years ago
- [ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding☆68May 24, 2025Updated 11 months ago
- أسئلة باللغة العربية تركز على الثقافة السعودية تم اختبارها على عدد من النماذج اللغوية الضخمة LLMs☆18Jan 22, 2025Updated last year
- ☆29Sep 17, 2024Updated last year
- Egyptian ID Card Recognition System 💳 A Python-based application to detect and process Egyptian ID cards using YOLO and EasyOCR.☆44Feb 22, 2025Updated last year
- ☆45Aug 11, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The GPT-4 function calls used in everchanging quest for the HF game jam☆10Jul 9, 2023Updated 2 years ago
- Large Language Models: In this repository Language models are introduced covering both theoretical and practical aspects.☆393Oct 9, 2025Updated 7 months ago
- ☆11Dec 29, 2024Updated last year
- [NeurIPS 2021] "Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models" by Boxin Wang*, Chejian Xu*, Shuoh…☆13Apr 3, 2023Updated 3 years ago
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated last month
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 5 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Maha is a text processing library specially developed to deal with Arabic text.☆216Updated this week
- Synthetic Data Generation for Evaluation☆14Feb 21, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Arabic nested named entity recognition☆46Mar 10, 2025Updated last year
- ☆13Mar 16, 2025Updated last year
- Code for Arabic Nougat☆52Nov 28, 2024Updated last year
- Code for the MTEB Arena☆24Jul 2, 2025Updated 10 months ago
- see github.com/understanding-search/maze-transformer☆10Dec 8, 2023Updated 2 years ago
- A central hub for translating stuff into Arabic (Join our Discord Server, if you want to help)☆50Jan 12, 2026Updated 4 months ago
- [NeurIPS 2024] 🕸 GlotCC Dataset and Pipline☆20Apr 6, 2025Updated last year
- ☆13Jul 29, 2024Updated last year
- ☆10Nov 6, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This model detects arabic fonts (نسخ, رقعة) given a picture of the text, Live https://calbot.hawzen.me/☆18May 27, 2023Updated 2 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 10 months ago
- ☆12Jun 5, 2019Updated 6 years ago
- ☆10Mar 30, 2026Updated last month
- The code for the Sales Dashboard demo☆16May 19, 2025Updated last year
- Tunisian Arabish Corpus☆12Mar 12, 2024Updated 2 years ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year