microsoft / SDRLinks
Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference
☆45Updated 3 years ago
Alternatives and similar repositories for SDR
Users that are interested in SDR are comparing it to the libraries listed below
Sorting:
- Code for text augmentation method leveraging large-scale language models☆61Updated 4 years ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆48Updated 3 years ago
- exBERT on Transformers🤗☆10Updated 4 years ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Updated 2 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆51Updated 2 years ago
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 3 years ago
- Long-context pretrained encoder-decoder models☆96Updated 3 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated 2 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆62Updated 4 years ago
- FactSumm: Factual Consistency Scorer for Abstractive Summarization☆113Updated 2 years ago
- Knowledge Infused Decoding☆71Updated 2 years ago
- Ensembling Hugging Face transformers made easy☆61Updated 3 years ago
- ☆35Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 4 years ago
- ☆30Updated 3 years ago
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆138Updated 2 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Updated 3 years ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Updated 4 years ago
- Calculating Expected Time for training LLM.☆38Updated 2 years ago
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Updated 3 years ago
- Official Pytorch implementation of (Roles and Utilization of Attention Heads in Transformer-based Neural Language Models), ACL 2020☆16Updated 10 months ago
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆17Updated 3 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 3 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Updated 5 years ago
- Convenient Text-to-Text Training for Transformers☆19Updated 4 years ago
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆23Updated 3 years ago
- ☆54Updated 3 years ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attention☆41Updated 4 years ago
- ☆21Updated 4 years ago