zejunwang1 / bert4vec
一个基于预训练的句向量生成工具
☆134Updated last year
Alternatives and similar repositories for bert4vec:
Users that are interested in bert4vec are comparing it to the libraries listed below
- experiments of some semantic matching models and comparison of experimental results.☆161Updated last year
- CoSENT、STS、SentenceBERT☆163Updated last week
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆105Updated last year
- 3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型☆291Updated 2 years ago
- SimBERT升级版(SimBERTv2)!☆441Updated 2 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆115Updated last year
- 中文自然语言推理与语义相似度数据集☆345Updated 3 years ago
- 真 · “Deep Learning for Humans”☆141Updated 3 years ago
- ChatGLM-6B fine-tuning.☆135Updated last year
- llama信息抽取实战☆98Updated last year
- text embedding☆144Updated last year
- Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021☆234Updated 2 years ago
- Minimal keyword extraction with BERT☆79Updated 3 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆175Updated 3 years ago
- RoFormer升级版☆152Updated 2 years ago
- 无监督中文关键词抽取(Keyphrase Extraction),基于统计,基于图【LDA与PageRank(TextRank, TPR, Salience Rank, Single TPR等)】,基于嵌入【SIFRank等】,开箱即用!☆105Updated 2 years ago
- A framework for cleaning Chinese dialog data☆265Updated 3 years ago
- 评估自然语言的流畅度☆111Updated 3 years ago
- 中文无监督SimCSE Pytorch实现☆133Updated 3 years ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆97Updated 2 years ago
- 文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT☆73Updated 2 months ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆67Updated 2 years ago
- 中文文本纠错模型,keras实现☆70Updated 3 years ago
- 收集了目前为止中文领域的MRC抽取式数据集☆119Updated 8 months ago
- Correcting Chinese Spelling Errors with Phonetic Pre-training 非官方实现☆40Updated 3 years ago
- 比Sentence-BERT更有效的句向量方案☆367Updated 2 years ago
- 继续预训练中文bert☆30Updated 3 years ago
- 时间抽取、解析、标准化工具☆50Updated 2 years ago
- 中文bigbird预训练模型☆91Updated 2 years ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆62Updated 3 years ago