tomoharr / BertAdaptationMLM-NSP
Bert domain adaptation with target corpus by pretraining tasks (Masked Language Model & Next Sentence Prediction). These codes uses huggingface 🤗transformers.
☆9Updated 5 years ago
Alternatives and similar repositories for BertAdaptationMLM-NSP:
Users that are interested in BertAdaptationMLM-NSP are comparing it to the libraries listed below
- Code and models for the paper titled "Better Feature Integration for Named Entity Recognition", NAACL 2021.☆30Updated 3 years ago
- The source code of paper "PAIR-LEVEL SUPERVISED CONTRASTIVE LEARNING FOR NATURAL LANGUAGE INFERENCE" at ICASSP 2022.☆48Updated last year
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Updated 2 years ago
- pytorch版simcse无监督语义相似模型☆22Updated 3 years ago
- The 4th rank system of the SemEval 2021 Task4.☆10Updated 2 years ago
- huggingface ChineseBert Tokenizer☆15Updated 2 years ago
- Code for the paper "Partially-Aligned Data-to-Text Generation with Distant Supervision" in EMNLP 2020.☆19Updated 4 years ago
- A concise implementation of SimCSE☆17Updated 3 years ago
- Source code for the ACL-IJCNLP 2021 paper entitled "T-DNA: Taming Pre-trained Language Models with N-gram Representations for Low-Resourc…☆19Updated 2 years ago
- a script from ERNIE1.0 or ERNIE2.0 to transfomers' BERT format☆10Updated 5 years ago
- ☆21Updated 3 years ago
- ☆46Updated 3 years ago
- 基于回译增强数据,目前整合了百度、有道、谷歌(需翻墙)翻译。☆20Updated 4 years ago
- ☆10Updated 3 years ago
- This is my study notes for my PhD in AI, NLP, IR, and more.☆17Updated 3 years ago
- ☆13Updated 3 years ago
- Code for our AAAI2021 paper: Token-Aware Virtual Adversarial Training For Language Understanding.☆25Updated 4 years ago
- 蚂蚁金融自然语言处理竞赛。☆9Updated 6 years ago
- ☆15Updated 2 years ago
- Prompt-learning methods used BERT4Keras (PET, EFL and NSP-BERT), both for Chinese and English.☆29Updated 2 years ago
- ☆18Updated 3 years ago
- SNCSE: Contrastive Learning for Unsupervised Sentence Embedding with Soft Negative Samples☆75Updated 2 years ago
- CCL2020,“小牛杯”幽默计算任务数据发布☆22Updated 7 months ago
- ☆12Updated 5 years ago
- The dataset and PyTorch Implementation for ACL 2020 paper "MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Ans…☆44Updated 4 years ago
- 基于中文TaCL-BERT的中文命名实体识别及中文分词☆33Updated 3 years ago
- 无监督文本生成的一些方法☆48Updated 3 years ago
- K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (Findings of EMNLP …☆31Updated 2 years ago
- ALIGNIE: Few-Shot Fine-Grained Entity Typing with Automatic Label Interpretation and Instance Generation☆20Updated 2 years ago
- 目前只有阅读理解赛道的☆14Updated 4 years ago