zoranmedic / mdcr
Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientific fields
☆10Updated 2 years ago
Alternatives and similar repositories for mdcr:
Users that are interested in mdcr are comparing it to the libraries listed below
- A Test Collection of Computer Science Papers for Faceted Query by Example☆21Updated 3 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- ☆18Updated 2 years ago
- ☆24Updated 4 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Updated 2 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Updated last year
- Code for ECIR 2022 paper Local Citation Recommendation with Hierarchical-Attention Text Encoder and SciBERT-based Reranking☆25Updated 6 months ago
- Submission archive for the MS MARCO passage ranking leaderboard☆13Updated last year
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 2 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆27Updated last year
- ☆37Updated 3 years ago
- MultiCite code and data. Models are available on Huggingface.☆29Updated 2 years ago
- ☆13Updated last year
- Cross language information retrieval pipeline☆18Updated last year
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆51Updated last year
- Knowledge graph based information retrieval☆13Updated 6 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆66Updated 2 years ago
- CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing (ACL 2022)☆9Updated 2 years ago
- X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents (JCDL 2022)☆14Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 5 months ago
- ☆10Updated 4 years ago
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Updated 2 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Updated 2 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- ☆30Updated 3 years ago
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆16Updated 2 months ago
- PyTorch code for "FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization" (NAACL 2022)☆38Updated 2 years ago
- ☆28Updated 4 months ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Updated 2 years ago
- Entity Linking & discovery solution. Agarwal et al., "Entity Linking via Explicit Mention-Mention Coreference Modeling", NAACL 2022.☆26Updated 9 months ago