sunkx109 / GPUs-SpecsView external linksLinks
Summary of the Specs of Commonly Used GPUs for Training and Inference of LLM
β75Aug 12, 2025Updated 6 months ago
Alternatives and similar repositories for GPUs-Specs
Users that are interested in GPUs-Specs are comparing it to the libraries listed below
Sorting:
- πAutomatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)β12Feb 8, 2026Updated last week
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)β26Jan 22, 2026Updated 3 weeks ago
- repository for the MICCAI 2022 AutoPET challengeβ14Sep 19, 2022Updated 3 years ago
- β16Jul 28, 2021Updated 4 years ago
- β13Jan 23, 2021Updated 5 years ago
- β49Apr 15, 2024Updated last year
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.β120Mar 13, 2024Updated last year
- β19Aug 26, 2021Updated 4 years ago
- PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.β28Feb 3, 2026Updated last week
- Separate from hardware and used to learn some NCCL mechanismsβ25Apr 19, 2024Updated last year
- Distributed Compiler based on Triton for Parallel Systemsβ1,350Updated this week
- Getting Starting with NIMBUS-COREβ10Dec 16, 2023Updated 2 years ago
- Official Repo of CudaForgeβ59Dec 2, 2025Updated 2 months ago
- Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papβ¦β283Mar 6, 2025Updated 11 months ago
- β19Jul 1, 2020Updated 5 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graphβ56Jul 3, 2022Updated 3 years ago
- My learning notes about AI, including Machine Learning and Deep Learning.β18Jun 30, 2019Updated 6 years ago
- A benchmark suite for evaluating FaaS scheduler.β23Nov 5, 2022Updated 3 years ago
- Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computingβ68Jan 8, 2025Updated last year
- Dynamic resources changes for multi-dimensional parallelism trainingβ30Aug 22, 2025Updated 5 months ago
- Materials for learning SGLangβ743Jan 5, 2026Updated last month
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).β25Jul 15, 2025Updated 6 months ago
- A fast communication-overlapping library for tensor/expert parallelism on GPUs.β1,247Aug 28, 2025Updated 5 months ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Waβ¦β33Apr 9, 2023Updated 2 years ago
- Asynchronous pipeline parallel optimizationβ19Feb 2, 2026Updated last week
- β343Jan 28, 2026Updated 2 weeks ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.coβ¦β13Jan 16, 2026Updated 3 weeks ago
- Cycle-accurate C++ & SystemC simulator for the RISC-V GPGPU Ventusβ31Dec 24, 2025Updated last month
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoningβ65Oct 31, 2025Updated 3 months ago
- ArchExplorer: Microarchitecture Exploration Via Bottleneck Analysisβ33Feb 20, 2024Updated last year
- MrlX: A Multi-Agent Reinforcement Learning Frameworkβ190Jan 19, 2026Updated 3 weeks ago
- π° Must-read papers on KV Cache Compression (constantly updating π€).β658Sep 30, 2025Updated 4 months ago
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and β¦β36Aug 29, 2025Updated 5 months ago
- ACM-ICPC Templateβ29Nov 17, 2025Updated 2 months ago
- NexRL is an ultra-loosely-coupled LLM post-training framework.β97Feb 4, 2026Updated last week
- Python tools for meshing riversβ12Oct 2, 2025Updated 4 months ago
- Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline modβ¦β617Sep 11, 2024Updated last year
- A library to analyze PyTorch traces.β464Feb 4, 2026Updated last week
- Latency and Memory Analysis of Transformer Models for Training and Inferenceβ478Apr 19, 2025Updated 9 months ago