dell-research-harvard / NEWS-COPY
Noise-robust de-duplication at scale
☆17Updated last year
Alternatives and similar repositories for NEWS-COPY:
Users that are interested in NEWS-COPY are comparing it to the libraries listed below
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Updated 2 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆51Updated last year
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆12Updated last year
- Automatically detect errors in annotated corpora.☆47Updated last year
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆15Updated 3 years ago
- Code for "Dynamic Contextualized Word Embeddings"☆31Updated 3 years ago
- ☆26Updated 6 months ago
- X-SRL Dataset. Including the code for the SRL annotation projection tool and an out-of-the-box word alignment tool based on Multilingual …☆15Updated 3 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Data for the HIPE 2022 shared task.☆16Updated last year
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆15Updated last year
- Learning from Neighbors: Unsupervised Text Classification☆17Updated 2 years ago
- The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science …☆27Updated 2 years ago
- ☆15Updated 7 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆83Updated last week
- Package to extract connotation frames☆83Updated last year
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆80Updated 10 months ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆12Updated 5 years ago
- ☆41Updated 4 years ago
- ☆16Updated last month
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆13Updated 8 months ago
- How Contextual are Contextualized Word Representations?☆41Updated 4 years ago
- ☆12Updated 3 years ago
- ☆85Updated 3 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆27Updated 3 years ago
- For our EMNLP 2020 paper “Are ‘Undocumented Workers’ the Same as ‘Illegal Aliens’? Disentangling Denotation and Connotation in Vector Spa…☆11Updated 4 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆175Updated last year
- Code for the paper "Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora", ACL 2020.☆18Updated 4 years ago
- Information and data related to the ProtestNews shared task at CASE @ ACL-IJCNLP 2021 workshop☆43Updated 2 years ago