JohnGiorgi / DeCLUTR
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
☆380Updated last year
Alternatives and similar repositories for DeCLUTR:
Users that are interested in DeCLUTR are comparing it to the libraries listed below
- ☆344Updated 3 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆362Updated 3 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- Autoregressive Entity Retrieval☆782Updated last year
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆327Updated last year
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆293Updated 2 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆603Updated 2 years ago
- Interpretable Evaluation for AI Systems☆363Updated 2 years ago
- IEEE/ACM TASLP 2020: SBERT-WK: A Sentence Embedding Method By Dissecting BERT-based Word Models☆178Updated 4 years ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆262Updated 2 years ago
- ☆120Updated 4 years ago
- KnowBert -- Knowledge Enhanced Contextual Word Representations☆375Updated 4 years ago
- ☆221Updated last year
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆429Updated last year
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆529Updated 3 years ago
- Officially supported AllenNLP models☆538Updated 2 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆312Updated last year
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆154Updated 2 years ago
- A library to conduct ranking experiments with transformers.☆161Updated last year
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆172Updated 2 years ago
- BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision☆291Updated 3 years ago
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆781Updated 10 months ago
- An elaborate and exhaustive paper list for Named Entity Recognition (NER)☆394Updated 3 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆432Updated 2 years ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆348Updated last year
- Implementation of Self-adjusting Dice Loss from "Dice Loss for Data-imbalanced NLP Tasks" paper☆107Updated 4 years ago
- ☆293Updated 2 years ago
- Search Engines with Autoregressive Language models☆283Updated last year
- docTTTTTquery document expansion model☆361Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago