webis-de / scidata22-stereo-scientific-text-reuse
☆11Updated 6 months ago
Related projects: ⓘ
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated 9 months ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆13Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆50Updated last year
- A pipeline using LLMs for Knowledge Engineering, combining knowledge probing and Wikidata entity mapping.☆34Updated 10 months ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆11Updated 2 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 7 months ago
- The official implementation of the iConference 2022 paper "Identifying Machine-Paraphrased Plagiarism".☆16Updated last year
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆11Updated last year
- StAtutory Reasoning Assessment☆11Updated last year
- ☆19Updated last year
- ☆19Updated 2 years ago
- Semantically Structured Sentence Embeddings☆65Updated 10 months ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆23Updated 6 months ago
- ☆26Updated last year
- Code/data for MARG (multi-agent review generation)☆24Updated 4 months ago
- The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science …☆27Updated last year
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆18Updated 10 months ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆12Updated last year
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆63Updated last year
- Implementation of the Paper "Towards an Automated Argument Mining Pipeline to Transform Plain Text to Argument Graphs"☆21Updated 7 months ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆31Updated 3 years ago
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 3 years ago
- Data and code for the SciFact-Open task☆24Updated 9 months ago
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆15Updated last year
- Wikipedia based dataset to train relationship classifiers and fact extraction models☆24Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆39Updated 2 years ago
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models☆13Updated last year