facebookresearch / belebeleLinks
Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.
☆330Updated 6 months ago
Alternatives and similar repositories for belebele
Users that are interested in belebele are comparing it to the libraries listed below
Sorting:
- Build, evaluate, understand, and fix LLM-based apps☆489Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆261Updated 11 months ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆187Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated last year
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆138Updated 3 weeks ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆188Updated 10 months ago
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆503Updated 5 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 7 months ago
- ☆520Updated 7 months ago
- Pipeline for pulling and processing online language model pretraining data from the web☆178Updated last year
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically d…☆299Updated last year
- Let's build better datasets, together!☆260Updated 6 months ago
- The official evaluation suite and dynamic data release for MixEval.☆243Updated 7 months ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆208Updated last month
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated last month
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆114Updated 2 years ago
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆222Updated 2 years ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆304Updated last year
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆302Updated 2 years ago
- ☆455Updated last year
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆464Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆130Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆91Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆213Updated last year
- Scaling Data-Constrained Language Models☆335Updated 9 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆699Updated last year
- batched loras☆343Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆124Updated 10 months ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year