stanleylsx / text_embeddingLinks
一个用于训练句子embedding的工具,支持Cosent以及Simcse、infonce
☆20Updated last month
Alternatives and similar repositories for text_embedding
Users that are interested in text_embedding are comparing it to the libraries listed below
Sorting:
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆115Updated last year
- deep training task☆29Updated 2 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆108Updated 2 years ago
- easy-bert是一个中文NLP工具,提供诸多bert变体调用和调参方法,极速上手;清晰的设计和代码注释,也很适合学习☆79Updated 2 years ago
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆44Updated 2 years ago
- SMP 2023 ChatGLM金融大模型挑战赛 60 分baseline思路介绍☆185Updated last year
- llama信息抽取实战☆100Updated 2 years ago
- 基于pytorch的中文意图识别和槽位填充☆184Updated last year
- 基于pytorch的百度UIE命名实体识别。☆56Updated 2 years ago
- 支持ChatGLM2 lora微调☆40Updated 2 years ago
- 文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT☆74Updated 7 months ago
- 一个基于预训练的句向量生成工具☆137Updated 2 years ago
- 通用版面分析 | 中文文档解析 |Document Layout Analysis | layout paser☆46Updated last year
- ChatGLM-6B fine-tuning.☆135Updated 2 years ago
- 使用torch整合两种经典的指针NER抽取范式,分别是SpanBert和苏神的GlobalPointer,简单加了些tricks,配置后一键运行☆133Updated last year
- BLOOM 模型的指令微调☆24Updated 2 years ago
- 基于向量召回的检索式对话系统解决方案,dense retrieval,FAQ……☆33Updated 3 years ago
- basic framework for rag(retrieval augment generation)☆85Updated last year
- 使用sentence-transformers(SBert)训练自己的文本相似度数据集并进行评估。☆48Updated 3 years ago
- sentence-transformers to onnx 让sbert模型推理效率更快☆164Updated 3 years ago
- experiments of some semantic matching models and comparison of experimental results.☆162Updated 2 years ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆63Updated 3 years ago
- baichuan LLM surpervised finetune by lora☆63Updated 2 years ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆87Updated 2 years ago
- GoGPT:基于Llama/Llama 2训练的中英文增强大模型|Chinese-Llama2☆78Updated last year
- 中文文本纠错模型,keras实现☆74Updated 4 years ago
- 中文标注工具,支持NER、文本分类、关系标注、对话标注等。☆79Updated 11 months ago
- 最大开源中文问答数据集 ,助力中文LLM.The largest open-source Chinese Q&A dataset, supporting Chinese LLM☆10Updated last year
- Minimal keyword extraction with BERT☆85Updated 3 years ago
- 基于simcse的中文句向量生成☆15Updated 3 years ago