Oneflow-Inc / one-glmLinks

A more efficient GLM implementation!

☆54

Alternatives and similar repositories for one-glm

Users that are interested in one-glm are comparing it to the libraries listed below

Sorting:

seanzhang-zhichen / baichuan-Dynamic-NTK-ALiBi
百川Dynamic NTK-ALiBi的代码实现：无需微调即可推理更长文本
☆49Updated 2 years ago
LydiaXiaohongLi / Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆19Updated 2 years ago
beichao1314 / Open-Llama
The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
☆67Updated 2 years ago
vxfla / kanchil
Kanchil（鼷鹿）是世界上最小的偶蹄目动物，这个开源项目意在探索小模型（6B以下）是否也能具备和人类偏好对齐的能力。
☆113Updated 2 years ago
genggui001 / Megatron-DeepSpeed-Llama
☆84Updated 2 years ago
keezen / ntk_alibi
NTK scaled version of ALiBi position encoding in Transformer.
☆69Updated 2 years ago
jiahe7ay / infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…
☆58Updated last year
ProjectD-AI / LLaMA-Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆68Updated 2 years ago
OpenLMLab / scaling-rope
code for Scaling Laws of RoPE-based Extrapolation
☆73Updated 2 years ago
OpenLMLab / ChatZoo
Light local website for displaying performances from different chat models.
☆87Updated last year
K024 / chatglm-q
Another ChatGLM2 implementation for GPTQ quantization
☆53Updated 2 years ago
THUDM / icetk
A unified tokenization tool for Images, Chinese and English.
☆151Updated 2 years ago
OpenBMB / BMCook
Model Compression for Big Models
☆165Updated 2 years ago
alanshi / charset_mnbvc
本项目旨在对大量文本文件进行快速编码检测和转换以辅助mnbvc语料集项目的数据清洗工作
☆65Updated 2 weeks ago
OpenLMLab / MOSS_WebSearchTool
MOSS 003 WebSearchTool: A simple but reliable implementation
☆45Updated 2 years ago
LC1332 / Luotuo-Silk-Road
Silk Road will be the dataset zoo for Luotuo(骆驼). Luotuo is an open sourced Chinese-LLM project founded by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子…
☆40Updated last year
Longyichen / Alpaca-family-library
Summarize all open source Large Languages Models and low-cost replication methods for Chatgpt.
☆137Updated 2 years ago
Langboat / mengzi-zero-shot
NLU & NLG (zero-shot) depend on mengzi-t5-base-mt pretrained model
☆76Updated 3 years ago
ssbuild / moss_finetuning
moss chat finetuning
☆51Updated last year
CLUEbenchmark / SuperCLUE-Math6
SuperCLUE-Math6：新一代中文原生多轮多步数学推理数据集的探索之旅
☆60Updated last year
pleisto / yuren-baichuan-7b
基于baichuan-7b的开源多模态大语言模型
☆72Updated last year
CoinCheung / gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
☆98Updated last year
thaumstrial / FinetuneGLMWithPeft
Simple implementation of using lora form the peft library to fine-tune the chatglm-6b
☆84Updated 2 years ago
ExpressAI / AI-Gaokao
Gaokao Benchmark for AI
☆108Updated 3 years ago
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆153Updated 2 years ago
aplmikex / deduplication_mnbvc
文本去重
☆76Updated last year
thu-coai / OPD
OPD: Chinese Open-Domain Pre-trained Dialogue Model
☆75Updated 2 years ago
ssbuild / deep_training
deep learning
☆148Updated 5 months ago
the-seeds / imitater
Imitate OpenAI with Local Models
☆88Updated last year
ProjectD-AI / llama_inference
llama inference for tencentpretrain
☆99Updated 2 years ago