Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.
☆16Sep 25, 2024Updated last year
Alternatives and similar repositories for bias-in-german-nlg
Users that are interested in bias-in-german-nlg are comparing it to the libraries listed below
Sorting:
- German Text Embedding Clustering Benchmark☆18Mar 15, 2024Updated last year
- This repository holds the code for my master thesis entitles "The Association of Gender Bias with BERT - Measuring, Mitigating and Cross-…☆18Sep 19, 2022Updated 3 years ago
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆36Oct 1, 2025Updated 5 months ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆22Feb 14, 2024Updated 2 years ago
- German dataset for DPR model training☆19Jul 21, 2024Updated last year
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆43Feb 11, 2026Updated 3 weeks ago
- This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.☆28Dec 15, 2019Updated 6 years ago
- Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"☆30Aug 17, 2019Updated 6 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆88Sep 12, 2024Updated last year
- German GPT-2 model☆32Aug 17, 2021Updated 4 years ago
- Only for real elders who really want to experience old AI Dungeon experience.☆10Nov 8, 2020Updated 5 years ago
- Brent's code for FTIR deconvolution☆12Dec 17, 2020Updated 5 years ago
- ☆10Oct 2, 2024Updated last year
- Check if a datum exists without reading its value☆12Feb 28, 2026Updated last week
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆10Apr 14, 2025Updated 10 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Poetry Corpora Annotated on Aesthetic Emotions☆12Aug 2, 2022Updated 3 years ago
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 9 months ago
- decontamination☆26Updated this week
- Collection of description of concepts, procedures, and simple XSLT files for text processing, e.g. simplify InDesign documents (.idml) to…☆12Jan 9, 2020Updated 6 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago
- Fork of node-named (DNS server in node.js) with AXFR, IXFR, TCP support and more☆11Nov 3, 2021Updated 4 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 2 years ago
- Data structure that maps entries to numeric ids☆14Aug 16, 2015Updated 10 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Feb 12, 2026Updated 3 weeks ago
- ☆10Dec 17, 2020Updated 5 years ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Social Media Scraping tool (Instagram,Facebook,Twitter)☆10Jan 22, 2023Updated 3 years ago
- ☆10Jan 23, 2023Updated 3 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- Personal portfolio with React☆17Mar 30, 2022Updated 3 years ago
- AI Reddit bot that scrapes subreddits for questions, conducts research, and posts automated answers to help users with relevant informati…☆17Sep 13, 2024Updated last year
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated last month
- suffix array construction and searching algorithms for in-memory binary data.☆12Sep 10, 2022Updated 3 years ago
- ☆11Oct 27, 2025Updated 4 months ago
- ☆13Nov 28, 2025Updated 3 months ago