malteos / scincl
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)
☆67Updated 2 years ago
Alternatives and similar repositories for scincl:
Users that are interested in scincl are comparing it to the libraries listed below
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆51Updated last year
- Dataset accompanying the SPECTER model☆133Updated 2 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- ☆84Updated 9 months ago
- MultiCite code and data. Models are available on Huggingface.☆29Updated 2 years ago
- SciRepEval benchmark training and evaluation scripts☆72Updated 9 months ago
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Updated 2 years ago
- Code and model checkpoints for the MultiVerS model for scientific claim verification.☆45Updated last year
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆27Updated last year
- ☆53Updated 3 years ago
- cRocoDiLe is a dataset extraction tool for Relation Extraction using Wikipedia and Wikidata presented in REBEL (EMNLP 2021).☆66Updated last year
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆83Updated 2 weeks ago
- Multidocument Summarization for Literature Review Shared Task 2022☆29Updated 2 years ago
- The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science …☆27Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- [WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations☆87Updated 3 years ago
- ☆18Updated 2 years ago
- ☆52Updated 11 months ago
- Dense hybrid representations for text retrieval☆62Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 3 years ago
- ☆68Updated 3 years ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆21Updated 3 years ago
- S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)☆16Updated last year
- A Python Commonsense Knowledge Inference Toolkit☆63Updated last year
- Cross language information retrieval pipeline☆18Updated last year
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated last year
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated last year
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Updated last year
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆56Updated 2 years ago
- ☆66Updated 2 years ago