Thiqah / ArabLegalEvalLinks
An effort to benchmark Arabic legal reasoning in foundation models.
☆13Updated 3 months ago
Alternatives and similar repositories for ArabLegalEval
Users that are interested in ArabLegalEval are comparing it to the libraries listed below
Sorting:
- ☆124Updated last year
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆335Updated 8 months ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆56Updated 2 years ago
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆85Updated last month
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated 8 months ago
- A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic di…☆90Updated last year
- Arabic Tokenization Library. It provides many tokenization algorithms.☆107Updated last year
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated last year
- Arabic cleaning, normalization and segmentation library.☆70Updated last year
- An open collection of implementation tips, tricks and resources for training large language models☆479Updated 2 years ago
- Deliver safe & effective language models☆536Updated this week
- ☆42Updated 2 years ago
- 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning☆406Updated last year
- build gpt-index using chatgpt and sentence-transformers☆14Updated 2 years ago
- Easily embed, cluster and semantically label text datasets☆567Updated last year
- Scripts to finetune the official implementation of OpenAI's Whisper model☆23Updated 2 months ago
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆176Updated 2 months ago
- AraT5: Text-to-Text Transformers for Arabic Language Understanding☆91Updated last year
- Resources relating to the DLAI event: https://www.youtube.com/watch?v=eTieetk2dSw☆188Updated 2 years ago
- Arabic nested named entity recognition☆40Updated 5 months ago
- ☆109Updated 8 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆510Updated last year
- ☆17Updated 2 years ago
- Multilingual/multidomain question generation datasets, models, and python library for question generation.☆360Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆13Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated 2 years ago
- Let's build better datasets, together!☆263Updated 8 months ago
- TODa: Tamazight Open Dataset☆16Updated 7 months ago
- LLM Workshop by Sourab Mangrulkar☆394Updated last year
- ☆244Updated last year