iiis-turing-llm / llm-training-calculatorLinks

☆49

Alternatives and similar repositories for llm-training-calculator

Users that are interested in llm-training-calculator are comparing it to the libraries listed below

Sorting:

alibaba / ChatLearn
A flexible and efficient training framework for large-scale alignment tasks
☆388Updated this week
pp1230 / LLMGPUMemEstimator
The GPU RAM Estimator provides a simple tool for estimating GPU memory usage during training and inference.
☆34Updated last year
FlagOpen / FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
☆325Updated this week
mindspore-lab / mindformers
☆169Updated this week
genggui001 / Megatron-DeepSpeed-Llama
☆83Updated last year
CASIA-LM / ChineseWebText
☆172Updated last year
Glanvery / LLM-Travel
欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
☆329Updated 11 months ago
InternLM / InternEvo
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…
☆393Updated 2 weeks ago
alipay / PainlessInferenceAcceleration
Accelerate inference without tears
☆319Updated 4 months ago
hengjiUSTC / learn-llm
☆111Updated 8 months ago
ninehills / llm-inference-benchmark
LLM Inference benchmark
☆422Updated 11 months ago
OpenBMB / UltraEval
[ACL 2024 Demo] Official GitHub repo for UltraEval: An open source framework for evaluating foundation models.
☆244Updated 8 months ago
Tencent / KsanaLLM
☆463Updated this week
pengr / LLM-Synthetic-Data
A live reading list for LLM-synthetic-data.
☆308Updated last week
MoFHeka / LLaMA-Megatron
A LLaMA1/LLaMA12 Megatron implement.
☆28Updated last year
sunkx109 / llama
Inference code for LLaMA models
☆122Updated last year
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆151Updated 2 years ago
CoinCheung / gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
☆97Updated last year
OpenBMB / BMTrain
Efficient Training (including pre-training and fine-tuning) for Big Models
☆600Updated last month
HarderThenHarder / RLLoggingBoard
A visuailzation tool to make deep understaning and easier debugging for RLHF training.
☆228Updated 4 months ago
modelscope / dash-infer
DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …
☆259Updated last month
OpenMOSS / CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
☆416Updated 10 months ago
THUDM / LongBench
LongBench v2 and LongBench (ACL 25'&24')
☆926Updated 6 months ago
THUDM / slime
slime is a LLM post-training framework aiming for RL Scaling.
☆596Updated this week
THUDM / AlignBench
大模型多维度中文对齐评测基准 (ACL 2024)
☆398Updated 11 months ago
mindspore-lab / mindrlhf
☆36Updated 6 months ago
LLaMafia / llamafia.github
☆319Updated last year
OpenBMB / InfiniteBench
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
☆341Updated 9 months ago
LLMServe / DistServe
Disaggregated serving system for Large Language Models (LLMs).
☆642Updated 3 months ago
a-m-team / a-m-models
a-m-team's exploration in large language modeling
☆173Updated last month