GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way
β18Nov 4, 2025Updated 4 months ago
Alternatives and similar repositories for GlotEval
Users that are interested in GlotEval are comparing it to the libraries listed below
Sorting:
- KnowMAN: Weakly Supervised Multinomial Adversarial Networksβ12Nov 9, 2021Updated 4 years ago
- πΈ GlotWeb: Web Indexing for Minority Languages (WWW 2026)β17Feb 27, 2026Updated last week
- π Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignmentβ11Apr 6, 2025Updated 11 months ago
- πΈ GlotCC Dataset and Pipline -- NeurIPS 2024β20Apr 6, 2025Updated 11 months ago
- Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study thβ¦β22Mar 20, 2024Updated last year
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resourceβ¦β26Feb 16, 2026Updated 2 weeks ago
- β44Feb 11, 2026Updated 3 weeks ago
- Code for "BERTifying the Hidden Markov Model for Multi-Source Weakly Supervised Named Entity Recognition"β32Jun 20, 2023Updated 2 years ago
- Synthetic Text Dataset Generation for LLM projectsβ56Updated this week
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"β30Apr 2, 2022Updated 3 years ago
- Finite-state script normalization and processing utilitiesβ46Feb 25, 2026Updated last week
- β10Nov 12, 2024Updated last year
- β10Apr 6, 2023Updated 2 years ago
- Code for EMNLP 2021 main conference paper "Dynamic Knowledge Distillation for Pre-trained Language Models"β41Aug 9, 2022Updated 3 years ago
- This Repository contains the Demo Script, Code for all the sessions which I will be doing in Year 2025β12Jun 28, 2025Updated 8 months ago
- NamSor API v2 R SDK - classify personal names accurately by gender, country of origin, or ethnicity.β12Mar 15, 2021Updated 4 years ago
- A longitudinal dataset for academic literature, including papers, metadata, and citation graphs, Also available on π€ HuggingFace and Kagβ¦β16Sep 6, 2025Updated 6 months ago
- Residual Quantization Autoencoder, used for interpreting LLMsβ14Jan 1, 2025Updated last year
- Curated list of awesome datasets for various table understanding tasksβ18Sep 5, 2025Updated 6 months ago
- HRA ASCT+B Reporterβ10Nov 20, 2025Updated 3 months ago
- β11Jul 18, 2018Updated 7 years ago
- π΄π Data on Members of the 116th U.S. Congressβ10Dec 11, 2019Updated 6 years ago
- Crawler based on a modified browser to detect online tracking.β11Jul 19, 2023Updated 2 years ago
- The Tifinagh Hand-written Letters Datasetβ12Feb 17, 2024Updated 2 years ago
- β10Oct 2, 2024Updated last year
- Linear Attention for Efficient Bidirectional Sequence Modelingβ15May 13, 2025Updated 9 months ago
- β15May 12, 2025Updated 9 months ago
- Utilities to gather software metrics from tools (SONAR, etc) and store them into ElasticSearch for later display using Kibana.β11Dec 31, 2017Updated 8 years ago
- A repository for resources relating to NLP in the Balochi languageβ19Jun 3, 2023Updated 2 years ago
- β45Jul 5, 2022Updated 3 years ago
- β13Nov 28, 2025Updated 3 months ago
- decontaminationβ26Updated this week
- NERO-nlp is a PyPI package for biomedical Named Entity (Recognition) Ontologyβ12Oct 1, 2020Updated 5 years ago
- Repository for public code and data associated with the paper "Fake News on Twitter During the 2016 U.S. Presidential Electionβ12Dec 5, 2019Updated 6 years ago
- This is a package to implement the Robust Latent Dirichlet Approach in R.β10Apr 25, 2019Updated 6 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Modelsβ11Jan 19, 2024Updated 2 years ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ecβ¦β10Apr 14, 2025Updated 10 months ago
- β10Jun 3, 2017Updated 8 years ago
- Sentiment Analysis and Figurative Language in French Tweetsβ10Apr 16, 2019Updated 6 years ago