MaLA-LM / GlotEvalLinks
GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way
☆16Updated last month
Alternatives and similar repositories for GlotEval
Users that are interested in GlotEval are comparing it to the libraries listed below
Sorting:
- ☆37Updated 4 months ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Updated 4 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆63Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆71Updated last year
- Benchmarking Large Language Models☆104Updated 5 months ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆135Updated last year
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Updated 8 months ago
- ParaNames: A multilingual resource for parallel names☆37Updated last year
- State-of-the-art paired encoder and decoder models (17M-1B params)☆53Updated 4 months ago
- TimeLMs: Diachronic Language Models from Twitter☆111Updated last year
- Data for evaluating gender bias in coreference resolution systems.☆81Updated 6 years ago
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models☆14Updated 2 years ago
- German Text Embedding Clustering Benchmark☆18Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆85Updated last year
- Resources for cultural NLP research☆110Updated 2 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆137Updated 11 months ago
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆93Updated 2 years ago
- Repository for research in the field of Responsible NLP at Meta.☆204Updated 7 months ago
- ☆52Updated 2 years ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆224Updated last year
- SeeGULL is a broad-coverage stereotype dataset in English containing stereotypes about identity groups spanning 178 countries across 8 di…☆37Updated 2 years ago
- ☆24Updated 2 years ago
- Multidocument Summarization for Literature Review Shared Task 2022☆30Updated 3 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Updated last year
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆30Updated 4 years ago
- ☆39Updated last year
- Semantically Structured Sentence Embeddings☆69Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆26Updated last year
- Project Debater Early Access Program Tutorial☆25Updated last week