本项目提供了基于910B的huggingface LLM模型的Tensor Parallel(TP)部署教程,同时也可以作为一份极简的TP学习代码。
☆31Jan 6, 2026Updated 2 months ago
Alternatives and similar repositories for LLM-TP-Inference-on-910B
Users that are interested in LLM-TP-Inference-on-910B are comparing it to the libraries listed below
Sorting:
- Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this projec…☆33Jan 6, 2026Updated 2 months ago
- [NeurIPS 2020] "Once-for-All Adversarial Training: In-Situ Tradeoff between Robustness and Accuracy for Free" by Haotao Wang*, Tianlong C…☆44Dec 30, 2021Updated 4 years ago
- [ECCV2022] Optimization over Disentangled Encoding: Unsupervised Cross-Domain Point Cloud Completion via Occlusion Factor Manipulation☆30Nov 21, 2022Updated 3 years ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40May 26, 2025Updated 9 months ago
- This is llvm-nmx backend compiler☆12Aug 22, 2023Updated 2 years ago
- ☆11Feb 19, 2022Updated 4 years ago
- [AAAI 2022 Oral] This is a Pytorch implementation of the AAAI 2022 paper "Cross-Domain Empirical Risk Minimization for Unbiased Long-tail…☆33Feb 17, 2022Updated 4 years ago
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning☆36Aug 26, 2022Updated 3 years ago
- ☆13Apr 14, 2025Updated 10 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- ☆13Nov 5, 2024Updated last year
- ☆20Mar 10, 2025Updated 11 months ago
- Cheatsheet for slurm command lines☆10Apr 9, 2023Updated 2 years ago
- A CSS3 Overlay system for modal dialogs.☆66Dec 16, 2010Updated 15 years ago
- CLIPCleaner: Cleaning Noisy Labels with CLIP (ACM MM2024)☆13Apr 28, 2025Updated 10 months ago
- A Formalization of TeX in Coq☆11Feb 27, 2022Updated 4 years ago
- This kernel adds supports for running Docker on Sony Xperia 5 II (pdx206).☆10Mar 14, 2023Updated 2 years ago
- ☆11Oct 17, 2024Updated last year
- ☆33Jan 9, 2026Updated last month
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Ma…☆13Sep 13, 2024Updated last year
- 3rd party dependencies for DALI project☆11Updated this week
- ☆12Jul 24, 2025Updated 7 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 6 months ago
- Shared Cheat Sheet for Coq☆10Sep 8, 2016Updated 9 years ago
- The PyTorch implementation of DSM (EMNLP 2022).☆10Mar 26, 2024Updated last year
- 我的一些开源文档☆10Feb 18, 2025Updated last year
- Cross-GCN: Enhancing Graph Convolutional Network with k-Order Feature Interactions☆12Mar 26, 2020Updated 5 years ago
- Code for the paper "Multi-Task Learning of Object States and State-Modifying Actions from Web Videos" published in TPAMI☆11Mar 3, 2024Updated 2 years ago
- Course Info for VIP-GEAI☆11Apr 11, 2024Updated last year
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆22Feb 11, 2026Updated 3 weeks ago
- bev_lane_det with lower resolution☆10Sep 1, 2023Updated 2 years ago
- Causal Reasoning for Membership Inference Attacks☆11Oct 21, 2022Updated 3 years ago
- HSViT: Horizontally Scalable Vision Transformer☆13Nov 6, 2024Updated last year
- This repository collects awesome representative papers and resources for "From Pre-training to Post-training: A Survey on Time Series Fou…☆31Feb 1, 2026Updated last month
- This repository contains the dataset of the paper ARGUS: Context-Based Detection of Stealthy IoT Infiltration Attacks☆12Apr 28, 2023Updated 2 years ago
- Tool to convert JSON formatted discussion posts on Canvas LMS into HTML files - similar to saving student text-entry assignments☆13May 20, 2022Updated 3 years ago
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Dec 21, 2024Updated last year