An unofficial implementation of Poly-encoder (Poly-encoders: Transformer Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring)
☆249Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for PolyEncoder
Users that are interested in PolyEncoder are comparing it to the libraries listed below
Sorting:
- ☆167Apr 19, 2023Updated 2 years ago
- Poly-encoder architecture and pre-training pipeline implementation (pytorch)☆16Jun 29, 2020Updated 5 years ago
- 24*2个预训练的小型BERT模型,NLPer炼丹利器☆51Apr 12, 2020Updated 5 years ago
- TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)☆535May 19, 2021Updated 4 years ago
- ☆279Dec 8, 2020Updated 5 years ago
- Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021☆95Jul 8, 2021Updated 4 years ago
- Consistent dialogue generation☆16Oct 26, 2022Updated 3 years ago
- ☆25May 4, 2022Updated 3 years ago
- ☆63Jan 2, 2020Updated 6 years ago
- The score code of FastBERT (ACL2020)☆609Oct 29, 2021Updated 4 years ago
- PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-tur…☆51Jan 16, 2021Updated 5 years ago
- DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.☆325May 9, 2021Updated 4 years ago
- Dense Passage Retriever - is a set of tools and models for open domain Q&A task.☆1,863Apr 6, 2023Updated 2 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆121May 22, 2023Updated 2 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,643Oct 16, 2024Updated last year
- Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer☆542Dec 10, 2021Updated 4 years ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,799Oct 14, 2025Updated 5 months ago
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- 🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…☆16Oct 7, 2024Updated last year
- a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.☆1,542Jul 18, 2025Updated 8 months ago
- ☆448Oct 26, 2022Updated 3 years ago
- Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.☆3,155Jan 22, 2024Updated 2 years ago
- ☆443Jul 1, 2022Updated 3 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,984Nov 21, 2022Updated 3 years ago
- The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.☆33Nov 3, 2020Updated 5 years ago
- State-of-the-Art Text Embeddings☆18,390Mar 12, 2026Updated last week
- ☆90Jun 20, 2020Updated 5 years ago
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 4 years ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"☆345Nov 11, 2024Updated last year
- Multi-stage passage ranking: monoBERT + duoBERT☆110Nov 23, 2020Updated 5 years ago
- Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"☆1,417Jan 10, 2024Updated 2 years ago
- The sources codes of the DR-BERT model and baselines☆38Nov 17, 2021Updated 4 years ago
- a bert for retrieval and generation☆859Feb 26, 2021Updated 5 years ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,855Aug 2, 2024Updated last year
- ☆22Oct 14, 2021Updated 4 years ago
- ☆124Feb 3, 2019Updated 7 years ago
- this repository contains the dataset and the source code for the EMNLP 2019 paper "A Neural Citation Count Prediction Model based on Peer…☆10Oct 8, 2021Updated 4 years ago
- Grounded conversational dataset for end-to-end conversational AI (official DSTC7 data)☆175Aug 20, 2024Updated last year
- 简单的向量白化改善句向量质量☆487Jun 17, 2021Updated 4 years ago