SUSTech-IDEA / SUS-ChatLinks

SUS-Chat: Instruction tuning done right

☆49

Alternatives and similar repositories for SUS-Chat

Users that are interested in SUS-Chat are comparing it to the libraries listed below

Sorting:

WangRongsheng / Aurora
The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"
☆263Updated last year
Chinese-Tiny-LLM / Chinese-Tiny-LLM
☆230Updated last year
IEIT-Yuan / Yuan2.0-M32
Mixture-of-Experts (MoE) Language Model
☆189Updated 10 months ago
xverse-ai / XVERSE-65B
XVERSE-65B: A multilingual large language model developed by XVERSE Technology Inc.
☆139Updated last year
llm-factory / imitater
Imitate OpenAI with Local Models
☆87Updated 11 months ago
QwenLM / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆136Updated 7 months ago
zai-org / GLM-Edge
GLM Series Edge Models
☆146Updated last month
OpenLMLab / scaling-rope
code for Scaling Laws of RoPE-based Extrapolation
☆73Updated last year
yangjianxin1 / LongQLoRA
LongQLoRA: Extent Context Length of LLMs Efficiently
☆166Updated last year
FlagAI-Open / Aquila2
The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.
☆444Updated 9 months ago
thu-coai / CritiqueLLM
☆144Updated last year
OpenLMLab / ChatZoo
Light local website for displaying performances from different chat models.
☆87Updated last year
zzlgreat / smart_agent
☆105Updated last year
zejunwang1 / LLMTuner
大语言模型指令调优工具（支持 FlashAttention）
☆175Updated last year
AtomEcho / AtomBulb
旨在对当前主流LLM进行一个直观、具体、标准的评测
☆94Updated 2 years ago
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆66Updated 2 years ago
ssbuild / deep_training
deep learning
☆148Updated 2 months ago
Minami-su / character_AI_open
Generate multi-round conversation roleplay data based on self-instruct and evol-instruct.
☆134Updated 6 months ago
thu-coai / BPO
☆323Updated last year
aplmikex / deduplication_mnbvc
文本去重
☆75Updated last year
glide-the / InterpretationoDreams
使用langchain进行任务规划，构建子任务的会话场景资源，通过MCTS任务执行器，来让每个子任务通过在上下文中资源，通过自身反思探索来获取自身对问题的最优答案；这种方式依赖模型的对齐偏好，我们在每种偏好上设计了一个工程框架，来完成自我对不同答案的奖励进行采样策略
☆29Updated 3 weeks ago
CLUEbenchmark / SuperCLUE-RAG
中文原生检索增强生成测评基准
☆120Updated last year
GAIR-NLP / abel
SOTA Math Opensource LLM
☆333Updated last year
FlagOpen / Infinity-Instruct
☆48Updated last year
ziwang-com / zero-lora
zero零训练llm调参
☆31Updated 2 years ago
BAAI-Zlab / COIG
☆128Updated 2 years ago
K024 / chatglm-q
Another ChatGLM2 implementation for GPTQ quantization
☆54Updated last year
shootime2021 / APUS-xDAN-4.0-moe
Its an open source LLM based on MOE Structure.
☆58Updated last year
ArtificialZeng / llama3_explained
the newest version of llama3，source code explained line by line using Chinese
☆22Updated last year
CASIA-LM / ChineseWebText
☆173Updated last year