KaihuaTang / LLM-TP-Inference-on-910BLinks

本项目提供了基于910B的huggingface LLM模型的Tensor Parallel(TP)部署教程，同时也可以作为一份极简的TP学习代码。

☆27

Alternatives and similar repositories for LLM-TP-Inference-on-910B

Users that are interested in LLM-TP-Inference-on-910B are comparing it to the libraries listed below

Sorting:

xmu-xiaoma666 / Multimodal-Open-O1
Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…
☆29Updated 10 months ago
maple-research-lab / SLOT
☆101Updated last month
yujunhuics / Reyes
从零到一实现了一个多模态大模型，并命名为Reyes（睿视），R：睿，eyes：眼。Reyes的参数量为8B，视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的是Qwen2.5-7B-Instruct，Reyes也通过一个两层MLP投影层连…
☆22Updated 5 months ago
justchenhao / ChatDailyPapers
Build a daily academic subscription pipeline! Get daily Arxiv papers and corresponding chatGPT summaries with pre-defined keywords. It is…
☆41Updated 2 years ago
GAIR-NLP / MAYE
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
☆138Updated 3 months ago
ding523 / Curr_REFT
☆67Updated 2 months ago
bobo0810 / LearnDeepSpeed
DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）
☆173Updated last year
liangyuwang / zo2
ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory
☆166Updated 3 weeks ago
xxcheng0708 / pytorch-model-train-template
pytorch单精度、半精度、混合精度、单卡、多卡（DP / DDP）、FSDP、DeepSpeed模型训练代码，并对比不同方法的训练速度以及GPU内存的使用
☆114Updated last year
JerryYin777 / Cross-Layer-Attention
Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)
☆17Updated last year
wutaiqiang / MoSLoRA
☆111Updated last year
OpenGVLab / V2PE
[ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
☆54Updated 7 months ago
Chen-GX / C-3PO
[ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…
☆37Updated 3 months ago
Haochen-Wang409 / TreeVGR
Official implementation of "Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology"
☆49Updated 3 weeks ago
testtimescaling / testtimescaling.github.io
"what, how, where, and how well? a survey on test-time scaling in large language models" repository
☆56Updated this week
kxfan2002 / SophiaVL-R1
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
☆72Updated last month
waltonfuture / RL-with-Cold-Start
SFT+RL boosts multimodal reasoning
☆22Updated last month
lzhxmu / CPPO
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models
☆145Updated 2 months ago
FateScript / token_visualizer
Token level visualization tools for large language models
☆83Updated 6 months ago
Kamichanw / CoS
[ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"
☆23Updated last month
RifleZhang / LLaVA-Reasoner-DPO
☆85Updated 6 months ago
AMAP-ML / GPG
GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning
☆152Updated 2 months ago
OpenRLHF / OpenRLHF-M
An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.
☆138Updated 3 months ago
RUCAIBox / Virgo
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆105Updated 2 months ago
leo-yangli / VB-LoRA
This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).
☆39Updated 9 months ago
OpenDFM / MULTI-Benchmark
MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images
☆40Updated last month
THUDM / Awesome-Parameter-Efficient-Fine-Tuning-for-Foundation-Models
Parameter-Efficient Fine-Tuning for Foundation Models
☆79Updated 4 months ago
SkyworkAI / Skywork-Reward-V2
Scaling Preference Data Curation via Human-AI Synergy
☆95Updated last month
pkunlp-icler / PCA-EVAL
[ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
☆105Updated last year
kesenzhao / UV-CoT
☆25Updated last week