☆80Jul 11, 2022Updated 3 years ago
Alternatives and similar repositories for sentence-representations
Users that are interested in sentence-representations are comparing it to the libraries listed below
Sorting:
- ☆105May 30, 2023Updated 2 years ago
- [EMNLP 2021] Efficient Contrastive Learning via Novel Data Augmentation and Curriculum Learning☆17Jun 28, 2025Updated 8 months ago
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆297Oct 27, 2022Updated 3 years ago
- Code of ACL 2022 paper Debiased Contrastive Learning of Unsupervised Sentence Representations☆32Mar 16, 2022Updated 3 years ago
- Official implementaion of EMNLP 2022 paper "Generate, Discriminate, and Contrast: A Semi-Supervised Sentence Representation Learning Fram…☆23Nov 27, 2022Updated 3 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆94Jun 8, 2022Updated 3 years ago
- [NeurIPS 2022] Non-Linguistic Supervision for Contrastive Learning of Sentence Embeddings☆22Jan 30, 2023Updated 3 years ago
- An easy-to-use tool for phrase encoding and topic mining (unsupervised aspect extraction); Code base for ACL 2022 paper, UCTopic: Unsuper…☆48Apr 25, 2023Updated 2 years ago
- ☆35May 18, 2023Updated 2 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆378Apr 21, 2023Updated 2 years ago
- [EMNLP 2022] Differentiable Data Augmentation for Contrastive Sentence Representation Learning. https://arxiv.org/abs/2210.16536☆40Nov 1, 2022Updated 3 years ago
- Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021☆306Oct 23, 2023Updated 2 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- ☆12Feb 14, 2023Updated 3 years ago
- ☆43Jun 12, 2023Updated 2 years ago
- Poly-encoder architecture and pre-training pipeline implementation (pytorch)☆15Jun 29, 2020Updated 5 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆133Aug 6, 2025Updated 6 months ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,641Oct 16, 2024Updated last year
- PromptBERT: Improving BERT Sentence Embeddings with Prompts☆343Nov 22, 2023Updated 2 years ago
- Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"☆33Oct 23, 2022Updated 3 years ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆42Nov 15, 2023Updated 2 years ago
- 中文数据集下SimCSE+ESimCSE的实现☆191May 21, 2022Updated 3 years ago
- ACL 2022(findings): A Sentence is Worth 128 Pseudo Tokens: A Semantic-Aware Contrastive Learning Framework for Sentence Embeddings☆18Mar 23, 2022Updated 3 years ago
- DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings☆19Nov 24, 2021Updated 4 years ago
- [COLING'22] Code for our paper: "COLO: A Contrastive Learning based Re-ranking Framework for One-Stage Summarization"☆22Oct 21, 2022Updated 3 years ago
- Code for AAAI 2022 paper Unsupervised Sentence Representation via Contrastive Learning with Mixing Negatives☆23Jun 14, 2022Updated 3 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆76Dec 29, 2025Updated 2 months ago
- The code for our paper "NSP-BERT: A Prompt-based Zero-Shot Learner Through an Original Pre-training Task —— Next Sentence Prediction"☆230Oct 12, 2022Updated 3 years ago
- ☆21Jul 12, 2025Updated 7 months ago
- ☆10Jun 5, 2025Updated 8 months ago
- Repository for the code associated with the paper: Unsupervised Extractive Summarization using Mutual Information☆25Sep 11, 2021Updated 4 years ago
- ☆24Oct 8, 2024Updated last year
- This is the PyTorch code + data repository for paper "Hierarchical Multi-head Attentive Network for Evidence-aware Fake News Detection", …☆42Feb 19, 2022Updated 4 years ago
- Data and Baselines for AStitchInLanguageModels dataset☆12Oct 31, 2022Updated 3 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Nov 21, 2022Updated 3 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆76Sep 18, 2022Updated 3 years ago
- Code on IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems (WWW 2020)☆11Apr 18, 2021Updated 4 years ago
- Python package implementing the greedy string tiling algorithm for comparing string similarity☆12Mar 20, 2023Updated 2 years ago
- A Deep Generative Distance-Based Classifier for Out-of-Domain Detection with Mahalanobis Space☆12Jun 21, 2021Updated 4 years ago