webis-de / scidata22-stereo-scientific-text-reuseLinks
β11Updated 11 months ago
Alternatives and similar repositories for scidata22-stereo-scientific-text-reuse
Users that are interested in scidata22-stereo-scientific-text-reuse are comparing it to the libraries listed below
Sorting:
- π« SpaCy wrapper for ConceptNet π«β95Updated 2 years ago
- StAtutory Reasoning Assessmentβ14Updated 2 years ago
- Semantically Structured Sentence Embeddingsβ67Updated last year
- β22Updated 3 years ago
- A BERT-based application for reusable text classification at scaleβ38Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.β54Updated 2 years ago
- β22Updated 9 months ago
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolutionβ12Updated 2 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.β21Updated last year
- ParaNames: A multilingual resource for parallel namesβ37Updated last year
- A module to compute textual lexical richness (aka lexical diversity).β110Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitterβ111Updated last year
- β55Updated last year
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal β¦β32Updated 4 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"β14Updated 3 years ago
- Mining Legal Arguments in Court Decisions - Data and softwareβ71Updated 2 years ago
- πΎ Universal, customizable and deployable fine-grained evaluation for text generation.β24Updated 2 years ago
- An implementation of GrASP (Shnarch et. al., 2017)β22Updated 3 years ago
- A Wikipedia-based summarization datasetβ14Updated 2 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 qβ¦β89Updated last year
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweebanβ¦β105Updated last year
- Calculate Krippendorff's Alpha on any DataFrameβ42Updated 2 years ago
- SciWING is a modern toolkit for scientific document processing from WING-NUSβ63Updated 2 years ago
- Creating class-based TF-IDF matricesβ90Updated 3 years ago
- β34Updated 2 years ago
- Source code and data for Like a Good Nearest Neighborβ30Updated 9 months ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2β¦β68Updated 2 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)β71Updated 2 years ago
- β53Updated last year
- Open source library for few shot NLPβ78Updated 2 years ago