JohnGiorgi / DeCLUTR
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
☆379Updated last year
Related projects ⓘ
Alternatives and complementary repositories for DeCLUTR
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆526Updated 3 years ago
- ☆344Updated 3 years ago
- Autoregressive Entity Retrieval☆765Updated last year
- docTTTTTquery document expansion model☆356Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆363Updated 2 years ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆340Updated 11 months ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆325Updated 10 months ago
- A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks☆363Updated last year
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆261Updated last year
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆291Updated 2 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆605Updated 2 years ago
- Search Engines with Autoregressive Language models☆277Updated last year
- KnowBert -- Knowledge Enhanced Contextual Word Representations☆373Updated 4 years ago
- BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision☆291Updated 3 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆243Updated 2 years ago
- ☆120Updated 4 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆197Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- ☆294Updated last year
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation☆108Updated 3 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆309Updated last year
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆198Updated 3 years ago
- Interpretable Evaluation for AI Systems☆361Updated last year
- Library for Knowledge Intensive Language Tasks☆916Updated 2 years ago
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models☆177Updated 3 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆153Updated 2 years ago
- TextAugment: Text Augmentation Library☆402Updated 9 months ago
- ☆214Updated last year
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago