German Language Understanding Evaluation Benchmark @NAACL24
☆22Dec 11, 2025Updated 2 months ago
Alternatives and similar repositories for SuperGLEBer
Users that are interested in SuperGLEBer are comparing it to the libraries listed below
Sorting:
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- ☆13Dec 17, 2021Updated 4 years ago
- German Text Embedding Clustering Benchmark☆18Mar 15, 2024Updated last year
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆22Nov 10, 2024Updated last year
- German small and large versions of GPT2.☆20May 11, 2022Updated 3 years ago
- Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf☆20Dec 28, 2021Updated 4 years ago
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 7 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- ☆25Apr 28, 2020Updated 5 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆64Jul 29, 2024Updated last year
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated 2 years ago
- A repository containing the code for translating popular LLM benchmarks to German.☆32Aug 20, 2023Updated 2 years ago
- Extendable Scratch3 Programming Environment☆10Jan 24, 2026Updated last month
- The NLP Bias Identification Toolkit☆39Sep 8, 2023Updated 2 years ago
- ☆13Jan 25, 2026Updated last month
- Source code used in the blog☆12Feb 6, 2024Updated 2 years ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- A web application for studying Ancient Greek texts with integrated lexical, syntactic, and morphological analysis tools.☆20Dec 1, 2025Updated 3 months ago
- Coding utilities for quantitative legal studies☆14Dec 7, 2025Updated 3 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆48Oct 20, 2025Updated 4 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆110May 16, 2024Updated last year
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 9 months ago
- A Python package for feature selection on a simulated data stream☆10Apr 21, 2022Updated 3 years ago
- suffix array construction and searching algorithms for in-memory binary data.☆12Sep 10, 2022Updated 3 years ago
- ☆11Oct 14, 2020Updated 5 years ago
- alternative remote for Lego Boost with Pythonista and iOS☆10Aug 27, 2017Updated 8 years ago
- Collection of description of concepts, procedures, and simple XSLT files for text processing, e.g. simplify InDesign documents (.idml) to…☆12Jan 9, 2020Updated 6 years ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆53Sep 10, 2024Updated last year
- Image clustering☆13Jan 22, 2022Updated 4 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- decontamination☆26Dec 3, 2025Updated 3 months ago
- ☆10Dec 17, 2020Updated 5 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- ☆11Apr 6, 2021Updated 4 years ago
- lib for model runnig for ml competitions☆10Jan 14, 2018Updated 8 years ago
- Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!☆11Jun 12, 2023Updated 2 years ago
- Shallow baseline models for text in TensorFlow☆12Jul 1, 2017Updated 8 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago