stanleylsx / text_embeddingLinks
一个用于训练句子embedding的工具,支持Cosent以及Simcse、infonce
☆18Updated last week
Alternatives and similar repositories for text_embedding
Users that are interested in text_embedding are comparing it to the libraries listed below
Sorting:
- easy-bert是一个中文NLP工具,提供诸多bert变体调用和调参方法,极速上手;清晰的设计和代码注释,也很适合学习☆79Updated 2 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆114Updated last year
- 使用torch整合两种经典的指针NER抽取范式,分别是SpanBert和苏神的GlobalPointer,简单加了些tricks,配置后一键运行☆133Updated last year
- llama信息抽取实战☆100Updated 2 years ago
- 文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT☆74Updated 7 months ago
- 基于Pytorch的文本分类框架,支持TextCNN、Bert、Electra等。☆62Updated 2 years ago
- [Unofficial] Predict code for AAAI 2022 paper: Unified Named Entity Recognition as Word-Word Relation Classification☆53Updated 2 years ago
- 使用sentence-transformers(SBert)训练自己的文本相似度数据集并进行评估。☆48Updated 3 years ago
- 基于pytorch的百度UIE命名实体识别。☆56Updated 2 years ago
- deep training task☆29Updated 2 years ago
- 基于词汇信息融合的中文NER模型☆169Updated 3 years ago
- Knowledge Graph☆172Updated 2 years ago
- GlobalPointer的优化版/NER实体识别☆120Updated 3 years ago
- benchmark of KgCLUE, with different models and methods☆27Updated 3 years ago
- 中文文本纠错模型,keras实现☆74Updated 3 years ago
- 中文标注工具,支持NER、文本分类、关系标注、对话标注等。☆76Updated 10 months ago
- CoSENT、STS、SentenceBERT☆168Updated 4 months ago
- ChatGLM-6B fine-tuning.☆135Updated 2 years ago
- 一个基于预训练的句向量生成工具☆137Updated 2 years ago
- 本人项目进行中搜集的数据集,包含原始数据和经过处理后的数据,项目持续更新。☆114Updated 4 years ago
- 最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM☆10Updated last year
- 无监督中文关键词抽取(Keyphrase Extraction),基于统计,基于图【LDA与PageRank(TextRank, TPR, Salience Rank, Single TPR等)】,基于嵌入【SIFRank等】,开箱即用!☆105Updated 3 years ago
- 基于向量召回的检索式对话系统解决方案,dense retrieval,FAQ……☆33Updated 3 years ago
- 3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型☆299Updated 2 years ago
- baichuan LLM surpervised finetune by lora☆63Updated 2 years ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆63Updated 3 years ago
- 使用Mask LM预训练任务来预训练Bert模型。训练垂直领域语料的模型表征,提升下游任务的表现。☆45Updated 2 years ago
- Minimal keyword extraction with BERT☆84Updated 3 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆67Updated 2 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆108Updated last year