Summary of the Specs of Commonly Used GPUs for Training and Inference of LLM
β77Aug 12, 2025Updated 7 months ago
Alternatives and similar repositories for GPUs-Specs
Users that are interested in GPUs-Specs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- πAutomatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)β12Mar 21, 2026Updated last week
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)β30Jan 22, 2026Updated 2 months ago
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.β119Mar 13, 2024Updated 2 years ago
- repository for the MICCAI 2022 AutoPET challengeβ14Sep 19, 2022Updated 3 years ago
- Distributed Compiler based on Triton for Parallel Systemsβ1,394Mar 11, 2026Updated 2 weeks ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- β16Jul 28, 2021Updated 4 years ago
- β49Apr 15, 2024Updated last year
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Waβ¦β33Apr 9, 2023Updated 2 years ago
- Artifact for "Marconi: Prefix Caching for the Era of Hybrid LLMs" [MLSys '25 Outstanding Paper Award, Honorable Mention]β56Mar 5, 2025Updated last year
- Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papβ¦β282Mar 6, 2025Updated last year
- A fast communication-overlapping library for tensor/expert parallelism on GPUs.β1,273Aug 28, 2025Updated 6 months ago
- β358Jan 28, 2026Updated last month
- Separate from hardware and used to learn some NCCL mechanismsβ25Apr 19, 2024Updated last year
- A variant of Ahash written in C++.β10Mar 20, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- My learning notes about AI, including Machine Learning and Deep Learning.β18Jun 30, 2019Updated 6 years ago
- Implementation of the Modbus protocol in .NET; containing ASCII, RTU and TCP.β10Jan 12, 2026Updated 2 months ago
- β12May 23, 2018Updated 7 years ago
- Evaluation for 3D reconstruction, includes monocular depth, video depth, relative camera pose & multi-view point map estimation.β20Aug 26, 2025Updated 7 months ago
- Materials for learning SGLangβ785Jan 5, 2026Updated 2 months ago
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).β25Feb 22, 2026Updated last month
- β19Feb 28, 2022Updated 4 years ago
- PerFlow-AI is a programmable performance analysis, modeling, prediction tool for AI system.β32Mar 12, 2026Updated 2 weeks ago
- ArchExplorer: Microarchitecture Exploration Via Bottleneck Analysisβ33Feb 20, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Research about dataflow architectureβ12Nov 30, 2023Updated 2 years ago
- β13Jan 23, 2021Updated 5 years ago
- Cycle-accurate C++ & SystemC simulator for the RISC-V GPGPU Ventusβ32Mar 4, 2026Updated 3 weeks ago
- Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computingβ77Jan 8, 2025Updated last year
- An LLM inference engine, written in C++β19Feb 5, 2026Updated last month
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.β456Mar 17, 2026Updated last week
- A simple USB microphone with ADC oversampling using the STM32F407 MCU and MAX9814 microphone moduleβ10May 13, 2022Updated 3 years ago
- Asynchronous pipeline parallel optimizationβ19Feb 2, 2026Updated last month
- Shadowsocks/ShadowsocksR 账ε·ε¨ηΊΏηζ§β12Nov 25, 2018Updated 7 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A benchmark suite for evaluating FaaS scheduler.β23Nov 5, 2022Updated 3 years ago
- β17Apr 16, 2024Updated last year
- β19Aug 26, 2021Updated 4 years ago
- [HISTORICAL] A Lightweight (RISC-V) ISA Extension for AES and SM4β38Feb 4, 2021Updated 5 years ago
- A DAG processor and compiler for a tree-based spatial datapath.β16Aug 24, 2022Updated 3 years ago
- Estimate MFU for DeepSeekV3β26Jan 5, 2025Updated last year
- NexRL is an ultra-loosely-coupled LLM post-training framework.β104Mar 20, 2026Updated last week