webis-de / scidata22-stereo-scientific-text-reuseLinks
☆11Updated 9 months ago
Alternatives and similar repositories for scidata22-stereo-scientific-text-reuse
Users that are interested in scidata22-stereo-scientific-text-reuse are comparing it to the libraries listed below
Sorting:
- StAtutory Reasoning Assessment☆14Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆54Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated 8 months ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 3 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆69Updated 2 years ago
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extraction…☆105Updated last year
- Semantically Structured Sentence Embeddings☆68Updated 11 months ago
- ☆37Updated 3 weeks ago
- Open source library for few shot NLP☆79Updated 2 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆19Updated last year
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago
- Creating class-based TF-IDF matrices☆89Updated 2 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆68Updated 2 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆71Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆45Updated last year
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆105Updated last year
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.☆97Updated 2 years ago
- Repository for the paper "MultiNERD: A Multilingual, Multi-Genre and Fine-Grained Dataset for Named Entity Recognition (and Disambiguatio…☆45Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆68Updated last month
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- ParaNames: A multilingual resource for parallel names☆36Updated last year
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- ☆55Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆55Updated 2 years ago
- Experimental code used in pre-training the KBIR and KeyBART models☆26Updated 3 years ago
- SciWING is a modern toolkit for scientific document processing from WING-NUS☆63Updated 2 years ago