webis-de / scidata22-stereo-scientific-text-reuse
☆11Updated 4 months ago
Alternatives and similar repositories for scidata22-stereo-scientific-text-reuse:
Users that are interested in scidata22-stereo-scientific-text-reuse are comparing it to the libraries listed below
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆17Updated 8 months ago
- ☆22Updated 3 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆52Updated last year
- Semantically Structured Sentence Embeddings☆66Updated 6 months ago
- ☆54Updated last year
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆12Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 3 months ago
- ☆23Updated 3 months ago
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆15Updated last year
- A BERT-based application for reusable text classification at scale☆38Updated last year
- HDBSCAN Tuning for BERTopic Models☆45Updated last year
- MultiCite code and data. Models are available on Huggingface.☆31Updated 2 years ago
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- StAtutory Reasoning Assessment☆13Updated 2 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 4 years ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Updated last year
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆22Updated last year
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆12Updated 9 months ago
- ☆17Updated last year
- This repository hosts the code for a tokenizer of tweets.☆12Updated 6 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆58Updated 8 months ago
- The official implementation of the iConference 2022 paper "Identifying Machine-Paraphrased Plagiarism".☆17Updated 2 years ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Updated 4 months ago