sxontheway / Keep-LearningLinks

The record of what I‘ve been through.

☆100

Alternatives and similar repositories for Keep-Learning

Users that are interested in Keep-Learning are comparing it to the libraries listed below

Sorting:

OpenMOSS / CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
☆417Updated 11 months ago
cnstark / gputasker
An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star
☆188Updated 2 years ago
cauyxy / bilivideos
☆52Updated 2 years ago
godweiyang / GrabGPU
一款便捷的抢占显卡脚本
☆343Updated 6 months ago
Oneflow-Inc / libai
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
☆409Updated this week
bobo0810 / LearnDeepSpeed
DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）
☆173Updated last year
WangHuiNEU / llm
The Roadmap for LLMs
☆85Updated 2 years ago
CoinCheung / gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
☆97Updated last year
simtony / mlrunner
A light-weight script for maintaining a LOT of machine learning experiments.
☆91Updated 2 years ago
hengjiUSTC / learn-llm
☆112Updated 8 months ago
Glanvery / LLM-Travel
欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
☆329Updated last year
OpenBMB / BMPrinciples
A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…
☆284Updated last year
taishan1994 / sentencepiece_chinese_bpe
使用sentencepiece中BPE训练中文词表，并在transformers中进行使用。
☆118Updated 2 years ago
xxcheng0708 / pytorch-model-train-template
pytorch单精度、半精度、混合精度、单卡、多卡（DP / DDP）、FSDP、DeepSpeed模型训练代码，并对比不同方法的训练速度以及GPU内存的使用
☆114Updated last year
sunkx109 / llama
Inference code for LLaMA models
☆122Updated last year
Outsider565 / LoRA-GA
☆204Updated 9 months ago
OvJat / DeepSpeedTutorial
DeepSpeed Tutorial
☆100Updated 11 months ago
alibaba / Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
☆659Updated last year
Ascend / AscendSpeed
☆79Updated last year
muyaostudio / qwen2_seq_cls
使用 Qwen2ForSequenceClassification 简单实现文本分类任务。
☆73Updated last year
foocker / deeplearningtheory
☆260Updated 4 months ago
tingshua-yts / BetterDL
☆38Updated 2 years ago
bojone / bytepiece
更纯粹、更高压缩率的Tokenizer
☆481Updated 8 months ago
firechecking / CleanTransformer
an implementation of transformer, bert, gpt, and diffusion models for learning purposes
☆155Updated 9 months ago
flageval-baai / FlagEval
FlagEval is an evaluation toolkit for AI large foundation models.
☆339Updated 3 months ago
Oneflow-Inc / models
Models and examples built with OneFlow
☆98Updated 9 months ago
Pinging-ZJU / DNN-Printer
Python Scritpt which can be embedded into PyTorch model to print the model size.
☆19Updated 4 years ago
thunlp / DeltaPapers
Must-read Papers of Parameter-Efficient Tuning (Delta Tuning) Methods on Pre-trained Models.
☆285Updated 2 years ago
OpenLLMAI / OpenLLMWiki
OpenLLMWiki: Docs of OpenLLMAI. Survey, reproduction and domain/task adaptation of open source chatgpt alternatives/implementations. PiXi…
☆260Updated 7 months ago
mli / transformers-benchmarks
real Transformer TeraFLOPS on various GPUs
☆911Updated last year