sotlampr / universal-joy
A Dataset and Results for Classifying Emotions Across Languages
☆10Updated 3 years ago
Alternatives and similar repositories for universal-joy:
Users that are interested in universal-joy are comparing it to the libraries listed below
- Data and code repository of " Multilingual Fairness Evaluation for Hate Speech Detection ". LREC 2020.☆19Updated 2 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆56Updated last month
- Röttger et al. (WOAH at NAACL 2022): "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models"☆14Updated 2 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Automatically detect errors in annotated corpora.☆46Updated last year
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Updated 4 years ago
- ☆22Updated 3 years ago
- XED multilingual emotion datasets☆57Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Code for the paper "Measuring Bias in Contextualized Word Representations"☆35Updated 5 years ago
- Toxicity Detection in Context: Assuming that the comment exists in a thread and that the parent comment or/and the discussion topic are e…☆27Updated last year
- ☆17Updated 6 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆78Updated 9 months ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Multilingual Open Text☆25Updated 2 months ago
- ☆19Updated 2 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- ☆15Updated 6 years ago
- Contains data, format checker, scorer and baselines for the CLEF2020-CheckThat! Task 1.☆20Updated last year
- [COLING2020] A challenge dataset for Person SenTiment analysis in news domain.☆11Updated 2 years ago
- ☆24Updated 5 years ago
- Code for Embeddings-Based Clustering for Target Specific Stances☆24Updated 2 years ago
- ☆22Updated last year
- Pretraining scripts for BART transformer model☆11Updated last year
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Updated 4 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11Updated 4 years ago
- Code for the paper "Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora", ACL 2020.☆18Updated 4 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy☆13Updated 3 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 4 years ago