wangyuxinwhy / uniem
unified embedding model
☆853Updated last year
Alternatives and similar repositories for uniem:
Users that are interested in uniem are comparing it to the libraries listed below
- ChatGLM-6B 指令学习|指令数据|Instruct☆654Updated 2 years ago
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,178Updated 11 months ago
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆994Updated 11 months ago
- 使用peft库,对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调,并做lora model和base model的merge及4bit的量化(quantize)。☆359Updated last year
- chatglm多gpu用deepspeed和☆409Updated 9 months ago
- Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型☆410Updated last year
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆398Updated last year
- 开源SFT数据集整理,随时补充☆507Updated last year
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆962Updated 7 months ago
- 探索中文instruct数据在ChatGLM, LLaMA上的微调表现☆390Updated 2 years ago
- pCLUE: 1000000+多任务提示学习数据集☆490Updated 2 years ago
- CMMLU: Measuring massive multitask language understanding in Chinese☆752Updated 4 months ago
- chatglm 6b finetuning and alpaca finetuning☆1,542Updated last month
- ☆322Updated 10 months ago
- 语言模型中文认知能力分析☆236Updated last year
- PromptCLUE, 全中文任务支持零样本学习模型☆663Updated last year
- An Open-sourced Knowledgable Large Language Model Framework.☆1,304Updated 3 months ago
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆563Updated last year
- 聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)☆626Updated last year
- The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul ope…☆815Updated 10 months ago
- 基于开源embedding模型的中文向量效果测试☆139Updated last year
- SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding☆223Updated last year
- 多模态中文LLaMA&Alpaca大语言模型(VisualCLA)☆443Updated last year
- 雅意信息抽取大模型:在百万级人工构造的高质量信息抽取数据上进行指令微调,由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)☆300Updated 8 months ago
- Luotuo Embedding(骆驼嵌入) is a text embedding model, which developed by 李鲁鲁, 冷子昂, 陈启源, 蒟蒻等.☆266Updated last year
- alpaca中文指令微调数据集☆392Updated 2 years ago
- 骆驼:A Chinese finetuned instruction LLaMA. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技☆718Updated last year
- Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.☆600Updated 3 months ago
- 为ChatGLM设计的微调数据集生成工具,速来制作自己的猫娘。☆605Updated last year
- Repo for adapting Meta LlaMA2 in Chinese! META最新发布的LlaMA2的汉化版! (完全开源可商用)☆744Updated last year