webis-de / scidata22-stereo-scientific-text-reuse
☆11Updated last month
Alternatives and similar repositories for scidata22-stereo-scientific-text-reuse:
Users that are interested in scidata22-stereo-scientific-text-reuse are comparing it to the libraries listed below
- The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science …☆27Updated 2 years ago
- ☆21Updated last year
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆50Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated last week
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.☆34Updated 3 weeks ago
- The official implementation of the iConference 2022 paper "Identifying Machine-Paraphrased Plagiarism".☆17Updated 2 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆12Updated 5 months ago
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Updated 3 years ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Pre-train Static Word Embeddings☆34Updated this week
- ☆19Updated 2 years ago
- ☆14Updated 3 months ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆15Updated last year
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆18Updated 2 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆12Updated last year
- Data and code for the SciFact-Open task☆25Updated last year
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Updated last year
- ☆53Updated 3 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆22Updated last year
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆24Updated last year
- Multimodal extreme classification☆20Updated 8 months ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)☆16Updated last year
- ☆22Updated 2 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆65Updated last year
- A BERT-based application for reusable text classification at scale☆37Updated last year