yaushian / mSimCSE
mSimCSE: Multilingual SimCSE
☆33Updated last year
Related projects ⓘ
Alternatives and complementary repositories for mSimCSE
- ☆57Updated last year
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆73Updated 2 years ago
- Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"☆33Updated 2 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆133Updated 5 months ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆92Updated 2 years ago
- source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.☆56Updated 3 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆72Updated 2 years ago
- An easy-to-use tool for phrase encoding and topic mining (unsupervised aspect extraction); Code base for ACL 2022 paper, UCTopic: Unsuper…☆43Updated last year
- ☆11Updated last year
- Source codes for the paper "Local Additivity Based Data Augmentation for Semi-supervised NER"☆44Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆93Updated last year
- ☆36Updated 2 years ago
- Long-context pretrained encoder-decoder models☆95Updated 2 years ago
- ☆92Updated 3 years ago
- [NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining☆118Updated last year
- Dense hybrid representations for text retrieval☆61Updated last year
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆30Updated 4 years ago
- ☆54Updated last year
- ☆95Updated 2 years ago
- This is the code for the EMNLP2020 Finding paper "BERT for Monolingual and Cross-Lingual Reverse Dictionary"☆19Updated 4 years ago
- ConTextual Mask Auto-Encoder for Dense Passage Retrieval☆35Updated this week
- Supervised Contrastive Learning for Downstream Optimized Sequence Representations☆26Updated 3 years ago
- ☆78Updated 2 years ago
- Pre-training BART in Flax on The Pile dataset☆20Updated 3 years ago
- PyTorch reimplementation of REALM and ORQA☆22Updated 2 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Updated 3 years ago
- Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"☆18Updated last year
- ☆34Updated last year
- An Unsupervised Sentence Embedding Method by Mutual Information Maximization (EMNLP2020)☆60Updated 3 years ago
- [EMNLP'21] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.☆75Updated 2 years ago