microsoft / SDR
Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference
β45Updated 2 years ago
Alternatives and similar repositories for SDR:
Users that are interested in SDR are comparing it to the libraries listed below
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paperβ52Updated last year
- exBERT on Transformersπ€β10Updated 3 years ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Costβ40Updated last year
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answeringβ16Updated 2 years ago
- Code for text augmentation method leveraging large-scale language modelsβ60Updated 3 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)β76Updated last year
- [NAACL 2021] Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answeringβ36Updated 3 years ago
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"β11Updated 2 years ago
- Using business-level retrieval system (BM25) with Python in just a few lines.β31Updated last year
- Abstractive summarization using Bert2Bert framework.β31Updated 4 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasksβ63Updated 3 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transformβ17Updated 4 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"β26Updated 3 years ago
- Few-shot learning framework for opinion summarization published at EMNLP 2020.β35Updated 3 years ago
- β28Updated 2 years ago
- Test code of Inverse cloze task for information retrievalβ33Updated 4 years ago
- Don't Judge a Language Model by Its Last Layer: Contrastive Learning with Layer-Wise Attention Poolingβ9Updated 2 years ago
- β55Updated 2 years ago
- β34Updated last year
- Official repository for "Reweighting Strategy based on Synthetic Data Identification for Sentence Similarity (COLING2022)"β18Updated 2 years ago
- Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020β16Updated 2 years ago
- KETOD Knowledge-Enriched Task-Oriented Dialogueβ32Updated 2 years ago
- β67Updated 3 years ago
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domainβ23Updated 2 years ago
- β38Updated 2 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021β29Updated last year
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Treesβ23Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answeringβ38Updated 3 years ago
- Knowledge Infused Decodingβ71Updated last year
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)β56Updated 2 years ago