chan0park / VoynaSlov
☆18Updated last year
Related projects: ⓘ
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated 9 months ago
- Noise-robust de-duplication at scale☆15Updated last year
- ☆26Updated last year
- Information and data related to the ProtestNews shared task at CASE @ ACL-IJCNLP 2021 workshop☆43Updated last year
- ☆49Updated 6 months ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆26Updated 2 years ago
- ☆19Updated 6 months ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆19Updated last year
- Code, data, and models for "POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection"☆29Updated last month
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆23Updated 6 months ago
- Learning from Neighbors: Unsupervised Text Classification☆17Updated last year
- Dataset and code for directed sentiment analysis in news text.☆16Updated 3 years ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆26Updated 6 months ago
- Package to extract connotation frames☆78Updated 9 months ago
- ☆16Updated last year
- The Harvard USPTO Patent Dataset☆54Updated 9 months ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆12Updated last year
- ☆16Updated last year
- ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence (NAACL 2022)☆16Updated 11 months ago
- ☆37Updated last year
- Testing and training detection models for emoji-based hate speech.☆23Updated 2 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆82Updated last year
- Twitter dataset for 2022 Russian and Ukrainian crisis☆50Updated last year
- The official implementation of the iConference 2022 paper "Identifying Machine-Paraphrased Plagiarism".☆16Updated last year
- ☆19Updated 2 years ago
- ☆12Updated 6 months ago
- Additional material for the paper "MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations Prediction"☆53Updated last year
- ☆83Updated 2 years ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆15Updated 3 years ago
- ☆17Updated 6 years ago