valentinhofmann / superbizarre
Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"
☆15Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for superbizarre
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆30Updated last year
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆31Updated 2 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- ☆73Updated 3 years ago
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆29Updated 4 years ago
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆19Updated 2 years ago
- ☆21Updated 11 months ago
- ☆29Updated last year
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- A program to choose transfer languages for cross-lingual learning☆70Updated last year
- Pretraining scripts for BART transformer model☆11Updated last year
- ☆23Updated 4 years ago
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆15Updated 2 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆81Updated last month
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆74Updated last month
- This repository hosts the code for a tokenizer of tweets.☆12Updated 5 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated 11 months ago
- Library for experimenting with state-of-the-art evaluation metrics like UScore☆11Updated last year
- Noise-robust de-duplication at scale☆15Updated last year
- ☆37Updated 4 years ago
- How Contextual are Contextualized Word Representations?☆39Updated 4 years ago
- ☆27Updated last year
- Code for the paper "Measuring Bias in Contextualized Word Representations"☆36Updated 5 years ago
- This repository holds the code for my master thesis entitles "The Association of Gender Bias with BERT - Measuring, Mitigating and Cross-…☆15Updated 2 years ago
- A survey of corpora for Germanic low-resource languages and dialects☆24Updated 3 months ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆17Updated 3 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year