shibing624 / similarities
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
☆834Updated 4 months ago
Alternatives and similar repositories for similarities:
Users that are interested in similarities are comparing it to the libraries listed below
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆953Updated 6 months ago
- unified embedding model☆851Updated last year
- 中文CLIP预训练模型☆402Updated 2 years ago
- 多模态中文LLaMA&Alpaca大语言模型(VisualCLA)☆441Updated last year
- PromptCLUE, 全中文任务支持零样本学习模型☆660Updated last year
- text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。☆4,643Updated 2 months ago
- PaddleNLP UIE模型的PyTorch版实现☆623Updated last year
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,457Updated 10 months ago
- 一个简单快速的分词、命名实体 识别工具☆570Updated 8 months ago
- The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul ope…☆808Updated 9 months ago
- ChatGLM-6B 指令学习|指令数据|Instruct☆655Updated last year
- LexiLaw - 中文法律大模型☆814Updated last week
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆292Updated 7 months ago
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆987Updated 10 months ago
- Chat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句,基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。☆621Updated 6 months ago
- MuCGEC中文纠错数据集及文本纠错SOTA模型开源;Code & Data for our NAACL 2022 Paper "MuCGEC: a Multi-Reference Multi-Source Evaluation Dataset for Chinese Gr…☆526Updated last year
- 开源SFT数据集整理,随时补充☆498Updated last year
- chatglm多gpu用deepspeed和☆405Updated 8 months ago
- MiniRBT (中文小型预训练模型系列)☆266Updated last year
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,174Updated 10 months ago
- An Open-sourced Knowledgable Large Language Model Framework.☆1,286Updated 2 months ago
- ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SF…☆2,284Updated last year
- 基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等☆2,723Updated last year
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,674Updated last month
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆358Updated last year
- 中文自然语言推理与语义相似度数据集☆345Updated 3 years ago
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆407Updated last year
- 夫子•明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发,以 ChatGLM 为大模型底座,基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能,旨在为用户提供全方位、高精准的法律咨询与解答…☆314Updated 4 months ago
- 🚀WebUI integrated platform for latest LLMs | 各大语言模型的全流程工具 WebUI 整合包。支持主流大模型API接口和开源模型。支持知识库,数据库,角色扮演,mj文生图,LoRA和全参数微调,数据集制作,live2d等全流程应用…☆524Updated 3 months ago
- 比Sentence-BERT更有效的句向量方案☆366Updated 2 years ago