This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.
☆26Dec 9, 2024Updated last year
Alternatives and similar repositories for peacock
Users that are interested in peacock are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆128Mar 3, 2024Updated 2 years ago
- ☆38Feb 7, 2026Updated 2 months ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Apr 3, 2022Updated 4 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆46Apr 3, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Explore the content of Arabic text datasets.☆18May 23, 2022Updated 3 years ago
- Manazir OCR — Arabic-first, optics-inspired multi-model OCR. Extracts high-quality text and layout (HTML/Markdown) from Arabic documents …☆41Nov 2, 2025Updated 5 months ago
- ☆11Jul 7, 2023Updated 2 years ago
- Leverage Large Language Models to generate and execute code dynamically through an intuitive and easy-to-use API!☆17Mar 2, 2024Updated 2 years ago
- ☆29Sep 17, 2024Updated last year
- Shami Dialect Corpus (SDC)☆29Feb 13, 2018Updated 8 years ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Apr 20, 2025Updated 11 months ago
- This repository contains code for exploratory data analysis and machine learning on different datasets.☆13Nov 6, 2022Updated 3 years ago
- Large Language Models: In this repository Language models are introduced covering both theoretical and practical aspects.☆393Oct 9, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 4 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Maha is a text processing library specially developed to deal with Arabic text.☆214Mar 16, 2026Updated 3 weeks ago
- Synthetic Data Generation for Evaluation☆14Feb 21, 2025Updated last year
- This GitHub repository contains the source code for my Udemy course, where we focus on databases, query builders, Eloquent, and relations…☆16Mar 29, 2023Updated 3 years ago
- Code for the MTEB Arena☆24Jul 2, 2025Updated 9 months ago
- A central hub for translating stuff into Arabic (Join our Discord Server, if you want to help)☆49Jan 12, 2026Updated 3 months ago
- Homeworks, Midterm, & Capstone from ML BookCamp☆16Jan 28, 2022Updated 4 years ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Prediction of the activity of molecules/ligands that have been tested to bind or not bind to Beta-Lactamases using machine learning cl…☆10Mar 5, 2026Updated last month
- ☆10Jul 21, 2023Updated 2 years ago
- This model detects arabic fonts (نسخ, رقعة) given a picture of the text, Live https://calbot.hawzen.me/☆17May 27, 2023Updated 2 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 8 months ago
- ☆15Mar 6, 2021Updated 5 years ago
- ☆10Mar 30, 2026Updated last week
- ☆12Dec 24, 2024Updated last year
- Tunisian Arabish Corpus☆12Mar 12, 2024Updated 2 years ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆15Oct 20, 2023Updated 2 years ago
- ☆12Dec 6, 2021Updated 4 years ago
- lecture materials☆15Mar 11, 2026Updated last month
- WASM Based Document scanner and processor☆11Oct 12, 2023Updated 2 years ago
- Implementation of Contrastive Predictive Coding for Natural Language☆10Sep 16, 2020Updated 5 years ago
- Here I have designed a neural network with image inputs of size(300x300). The network is trained with a set of training images of fingerp…☆14Aug 7, 2020Updated 5 years ago
- Detection of Emotion and its cause from text☆19Jun 2, 2020Updated 5 years ago