JiangLiSJTU/token-ring

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JiangLiSJTU/token-ring)

JiangLiSJTU / token-ring

☆13

Alternatives and similar repositories for token-ring

Users that are interested in token-ring are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MXHX7199 / ICCV_2021_AFP
View on GitHub
AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.
☆13Nov 8, 2021Updated 4 years ago
flexflow / flexflow-serve
View on GitHub
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
☆85Sep 15, 2025Updated 10 months ago
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 5 months ago
TiledTensor / TiledBench
View on GitHub
Benchmark tests supporting the TiledCUDA library.
☆19Nov 19, 2024Updated last year
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆28Sep 4, 2025Updated 10 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
KuangjuX / cu-x
View on GitHub
🎉My Collections of CUDA Kernels~
☆11Jun 25, 2024Updated 2 years ago
Dao-AILab / gemm-cublas
View on GitHub
☆22May 5, 2025Updated last year
AlexwellChen / Toy_ML_Framework
View on GitHub
☆11May 16, 2026Updated 2 months ago
EdVince / whisper-trtllm
View on GitHub
Whisper in TensorRT-LLM
☆17Sep 21, 2023Updated 2 years ago
hazan-lab / flash-stu
View on GitHub
PyTorch implementation of the Flash Spectral Transform Unit.
☆22Sep 19, 2024Updated last year
hxdoit / lerobot
View on GitHub
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
☆33Feb 19, 2026Updated 4 months ago
mi150 / VaLoRA
View on GitHub
☆11May 19, 2025Updated last year
NVIDIA / free-threaded-python
View on GitHub
No-GIL Python environment featuring NVIDIA Deep Learning libraries.
☆71Apr 14, 2025Updated last year
foundation-model-stack / vllm-triton-backend
View on GitHub
A Triton-only attention backend for vLLM
☆27Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
yester31 / Cutlass_EX
View on GitHub
study of cutlass
☆22Nov 10, 2024Updated last year
antgroup / DeepXTrace
View on GitHub
DeepXTrace is a lightweight tool for precisely diagnosing slow ranks in DeepEP-based environments.
☆100Jan 16, 2026Updated 6 months ago
NonvolatileMemory / flash_tree_attn
View on GitHub
☆20Dec 24, 2024Updated last year
muriloboratto / NVSHEMEM
View on GitHub
Sample Codes using NVSHMEM on Multi-GPU
☆30Jan 22, 2023Updated 3 years ago
Psychic-DL / DiffTAD
View on GitHub
DiffTAD: Denoising Diffusion Probabilistic Models for Vehicle Trajectory Anomaly Detection
☆36Nov 28, 2023Updated 2 years ago
tile-ai / TileFoundry
View on GitHub
☆45Updated this week
WukLab / preble
View on GitHub
Stateful LLM Serving
☆105Mar 11, 2025Updated last year
DachengLi1 / AMP
View on GitHub
(NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.
☆44Nov 4, 2022Updated 3 years ago
madsys-dev / deepseekv2-profile
View on GitHub
☆156Mar 4, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
feifeibear / LLMRoofline
View on GitHub
Compare different hardware platforms via the Roofline Model for LLM inference tasks.
☆123Mar 13, 2024Updated 2 years ago
HPMLL / NVIDIA-Hopper-Benchmark
View on GitHub
☆113May 31, 2025Updated last year
tlc-pack / cutlass_fpA_intB_gemm
View on GitHub
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
☆96Jun 21, 2026Updated 3 weeks ago
KuangjuX / AttnLink
View on GitHub
An experimental communicating attention kernel based on DeepEP.
☆34Jul 29, 2025Updated 11 months ago
shanshw / LogConfigLocalizer
View on GitHub
☆17Apr 15, 2025Updated last year
MXHX7199 / SNN-SSTDP
View on GitHub
SSTDP is a efficient spiking neural network training framework, which is contributed by Fangxin Liu and Wenbo Zhao.
☆39Nov 8, 2021Updated 4 years ago
Infini-AI-Lab / vortex_torch
View on GitHub
Vortex: Programmable Sparse Attention for Agents as Algorithm Designers
☆67Jun 24, 2026Updated 3 weeks ago
AnilOsmanTur / video_anomaly_diffusion
View on GitHub
[ICIP 2023] Exploring Diffusion Models For Unsupervised Video Anomaly Detection
☆30Nov 8, 2023Updated 2 years ago
sauradip / DiffusionTAD
View on GitHub
[ICCV 2023] Official PyTorch implementation of the paper "DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion"
☆37Mar 30, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
infinigence / FlashOverlap
View on GitHub
A lightweight design for computation-communication overlap.
☆242Jan 20, 2026Updated 5 months ago
lukedodd / JitCalc
View on GitHub
Mathematical expression evaluator with just in time code generation.
☆12Apr 7, 2013Updated 13 years ago
kzkadc / regression-tta
View on GitHub
The official implementation of "Test-time Adaptation for Regression by Subspace Alignment" (ICLR 2025).
☆18Jun 6, 2025Updated last year
gty111 / gLLM
View on GitHub
An Efficient and Versatile Inference Engine for Distributed LLM Serving
☆66Jul 5, 2026Updated 2 weeks ago
Lou1sM / meaningful_image_complexity
View on GitHub
☆17Mar 24, 2025Updated last year
JieRen98 / SGEMM-SASS-Annotation
View on GitHub
☆21Mar 22, 2021Updated 5 years ago
davendw49 / gakg
View on GitHub
GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.
☆53Jul 11, 2024Updated 2 years ago