dell-research-harvard / NEWS-COPY
Noise-robust de-duplication at scale
☆15Updated last year
Related projects: ⓘ
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated 9 months ago
- Learning from Neighbors: Unsupervised Text Classification☆17Updated last year
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆15Updated 3 years ago
- ☆16Updated last year
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆12Updated last year
- ☆16Updated last year
- ConfliBERT: A Pre-trained Language Model for Political Conflict and Violence (NAACL 2022)☆16Updated 11 months ago
- ☆15Updated 7 years ago
- Repository for the paper Us vs. Them: A Dataset of Populist Attitudes, News Bias and Emotions☆16Updated 3 months ago
- Package to extract connotation frames☆78Updated 9 months ago
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models☆13Updated last year
- ☆19Updated 6 months ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆50Updated last year
- Code for "Dynamic Contextualized Word Embeddings"☆28Updated 2 years ago
- ☆13Updated 2 years ago
- ☆17Updated 6 years ago
- ☆22Updated last year
- Package for computing causal effects of text (as treatment)☆67Updated 2 years ago
- Information and data related to the ProtestNews shared task at CASE @ ACL-IJCNLP 2021 workshop☆43Updated last year
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆13Updated 2 years ago
- Semantically Structured Sentence Embeddings☆65Updated 10 months ago
- ☆49Updated 6 months ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆19Updated last year
- Find text features that are most related to an outcome, controlling for confounds.☆60Updated last month
- ☆54Updated 2 years ago
- Data for evaluating gender bias in coreference resolution systems.☆65Updated 5 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆39Updated 2 years ago
- Repository for the CommonLit Ease of Readability Corpus☆18Updated 5 months ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆26Updated 2 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆12Updated 5 years ago