neavo / KeywordGachaModelLinks
☆16Updated 11 months ago
Alternatives and similar repositories for KeywordGachaModel
Users that are interested in KeywordGachaModel are comparing it to the libraries listed below
Sorting:
- Imitate OpenAI with Local Models☆89Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆139Updated last year
- 中文预训练ModernBert☆95Updated 8 months ago
- A fluent, scalable, and easy-to-use LLM data processing framework.☆26Updated this week
- SUS-Chat: Instruction tuning done right☆49Updated last year
- This repository provides an implementation of "A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction B…☆85Updated 6 months ago
- Another ChatGLM2 implementation for GPTQ quantization☆54Updated 2 years ago
- ☆174Updated last year
- Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.☆136Updated last year
- ☆235Updated last year
- This project is established for real-time training of the RWKV model.☆50Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Updated last year
- Its an open source LLM based on MOE Structure.☆58Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Updated last year
- 文本去重☆77Updated last year
- The plan which extend ChatHaruhi into Zero-shot Roleplaying model☆115Updated last year
- XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.☆141Updated last year
- ☆11Updated last year
- Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…☆59Updated 3 months ago
- 本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作☆68Updated 2 months ago
- Mixture-of-Experts (MoE) Language Model☆192Updated last year
- Deep Reasoning Translation (DRT) Project☆240Updated 4 months ago
- ☆164Updated last week
- deep learning☆149Updated 8 months ago
- 大语言模型指令调优工具(支持 FlashAttention)☆178Updated 2 years ago
- GLM Series Edge Models☆156Updated 6 months ago
- (1)弹性区间标准化的旋转位置词嵌入编码器+peft LORA量化训练,提高万级tokens性能支持。(2)证据理论解释学习,提升模型的复杂逻辑推理能力(3)兼容alpaca数据格式。☆45Updated 2 years ago
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆91Updated 2 years ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆266Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆68Updated last week