krangelie / bias-in-german-nlg
Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.
☆14Updated last month
Related projects ⓘ
Alternatives and complementary repositories for bias-in-german-nlg
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆21Updated last month
- Multilingual Open Text☆25Updated 3 weeks ago
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆13Updated 9 months ago
- ☆31Updated last year
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆82Updated last month
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- ☆21Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆46Updated 3 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆25Updated 2 months ago
- ☆16Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆56Updated 5 months ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆24Updated 3 months ago
- Semantically Structured Sentence Embeddings☆67Updated last month
- This is the code for loading the SenseBERT model, described in our paper from ACL 2020.☆44Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- LTG-Bert☆29Updated 10 months ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Experiments for XLM-V Transformers Integeration☆13Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 2 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆31Updated 2 years ago
- Tools for managing datasets for governance and training.☆78Updated 3 weeks ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆31Updated 3 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆29Updated last year
- ☆73Updated 3 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated last year
- CrossRE: A Cross-Domain Dataset for Relation Extraction (Findings of EMNLP 2022)☆47Updated 3 months ago
- ☆12Updated 2 years ago