LSX-UniWue / SuperGLEBerView external linksLinks
German Language Understanding Evaluation Benchmark @NAACL24
☆22Dec 11, 2025Updated 2 months ago
Alternatives and similar repositories for SuperGLEBer
Users that are interested in SuperGLEBer are comparing it to the libraries listed below
Sorting:
- German Text Embedding Clustering Benchmark☆18Mar 15, 2024Updated last year
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- ☆13Dec 17, 2021Updated 4 years ago
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆35Oct 1, 2025Updated 4 months ago
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆22Nov 10, 2024Updated last year
- Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer" https://arxiv.org/pdf/2112.14569.pdf☆20Dec 28, 2021Updated 4 years ago
- My NER Experiments with ModernBERT and Ettin☆26Jul 17, 2025Updated 6 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- Small python package to measure OCR quality and other related metrics.☆27Feb 19, 2024Updated last year
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆64Jul 29, 2024Updated last year
- A repository containing the code for translating popular LLM benchmarks to German.☆31Aug 20, 2023Updated 2 years ago
- The NLP Bias Identification Toolkit☆39Sep 8, 2023Updated 2 years ago
- Extendable Scratch3 Programming Environment☆10Jan 24, 2026Updated 3 weeks ago
- ☆12Jan 25, 2026Updated 2 weeks ago
- A web application for studying Ancient Greek texts with integrated lexical, syntactic, and morphological analysis tools.☆19Dec 1, 2025Updated 2 months ago
- Source code used in the blog☆12Feb 6, 2024Updated 2 years ago
- speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, …☆11Dec 7, 2025Updated 2 months ago
- ☆10Oct 2, 2024Updated last year
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆48Oct 20, 2025Updated 3 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111May 16, 2024Updated last year
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Updated this week
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- decontamination☆24Dec 3, 2025Updated 2 months ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago
- ☆13Nov 28, 2025Updated 2 months ago
- ☆11Apr 6, 2021Updated 4 years ago
- lib for model runnig for ml competitions☆10Jan 14, 2018Updated 8 years ago
- Collection of description of concepts, procedures, and simple XSLT files for text processing, e.g. simplify InDesign documents (.idml) to…☆12Jan 9, 2020Updated 6 years ago
- An apa7 template for quarto/posit☆12Jan 25, 2023Updated 3 years ago
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆11Apr 14, 2025Updated 10 months ago
- ☆10Sep 13, 2022Updated 3 years ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germany☆52Sep 10, 2024Updated last year
- Ranger helps you see the forest among the trees - Ranger is an effect-size meta analysis library creating beautiful forest plots!☆11Jun 12, 2023Updated 2 years ago
- Data and code: "Answering legal questions from laymen in German civil law system", Büttner & Habernal, EACL'24☆13Mar 2, 2024Updated last year
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Image clustering☆13Jan 22, 2022Updated 4 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Shallow baseline models for text in TensorFlow☆12Jul 1, 2017Updated 8 years ago