wu-kan/GoPTX

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wu-kan/GoPTX)

wu-kan / GoPTX

GoPTX: Fine-grained GPU Kernel Fusion by PTX-level Instruction Flow Weaving

☆21

Alternatives and similar repositories for GoPTX

Users that are interested in GoPTX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Nelson-Cheung / yatsenos-riscv
View on GitHub
Rebuild YatSenOS On RISC-V 64.
☆23Jan 6, 2022Updated 4 years ago
MLSysU / EcoServe
View on GitHub
[OSDI' 26] Efficient LLM Serving on Commodity GPU Clusters with Data-Reduced Cross-Instance Orchestration
☆23Jul 5, 2026Updated 2 weeks ago
lukeluocn / multicoresysu2020
View on GitHub
☆11Aug 4, 2020Updated 5 years ago
sjtu-epcc / Tacker
View on GitHub
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆33Feb 10, 2025Updated last year
illinois-impact / klap
View on GitHub
A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches
☆15Jun 21, 2019Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
wu-kan / HPL-AI
View on GitHub
An implementation of HPL-AI Mixed-Precision Benchmark based on hpl-2.3
☆30May 30, 2021Updated 5 years ago
SYSU-SCC / sysu-scc-spack-repo
View on GitHub
Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.
☆16Aug 20, 2025Updated 11 months ago
nono-Sang / light-rpc
View on GitHub
An Efficient RDMA-based RPC Framework
☆25Nov 14, 2023Updated 2 years ago
arcsysu / YatCC
View on GitHub
中山大学编译原理课程实验（完全重构版本）
☆149Jun 19, 2026Updated last month
arcsysu / SYsU-lang
View on GitHub
A mini, simple and modular compiler for SYsU/SysY(tiny C). Based on Clang/LLVM/ANTLR4/Bison/Flex.
☆221Nov 27, 2024Updated last year
MLSysU / TD-Pipe
View on GitHub
[ICPP'25] TD-Pipe: Temporally-Disaggregated Pipeline Parallelism Architecture for High-Throughput LLM Inference
☆52Dec 24, 2025Updated 6 months ago
YatSenOS / YatSenOS-Tutorial-Volume-2
View on GitHub
A Rust x86_64 OS lab tutorial.
☆72Jun 23, 2026Updated 3 weeks ago
howardlau1999 / autograder-server
View on GitHub
☆24Mar 21, 2024Updated 2 years ago
SubjectNoi / RTANN
View on GitHub
☆17Aug 2, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
aoli-al / HFuse
View on GitHub
Horizontal Fusion
☆24Jan 7, 2022Updated 4 years ago
zejia-lin / BulletServe
View on GitHub
Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration
☆53Jan 8, 2026Updated 6 months ago
AlibabaResearch / recom
View on GitHub
An Optimizing Compiler for Recommendation Model Inference
☆26Jun 5, 2025Updated last year
SYSU-SCC / yatcpu-docs
View on GitHub
Documentation for YatCPU
☆55Nov 15, 2023Updated 2 years ago
SJTU-IPADS / PipeLLM
View on GitHub
☆28Dec 22, 2024Updated last year
baco-authors / baco
View on GitHub
☆17Dec 8, 2023Updated 2 years ago
owensgroup / ATOS
View on GitHub
Multi-GPU dynamic scheduler using PGAS style cross-GPU communication
☆29Jul 23, 2023Updated 2 years ago
ivechan / mentohust-SYSU
View on GitHub
mentohust的SYSU版本
☆19May 7, 2016Updated 10 years ago
GZTimeWalker / YYDB
View on GitHub
Yat another MySQL storage engine, a database course project.
☆13Dec 23, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
nadavrot / pgo_ml
View on GitHub
Source code for the paper "Profile Guided Optimization without Profiles: A Machine Learning Approach"
☆26Dec 30, 2021Updated 4 years ago
ray-project / contrib-workflow-dag
View on GitHub
☆11May 4, 2022Updated 4 years ago
chhzh123 / Krill
View on GitHub
An efficient concurrent graph processing system
☆46Oct 27, 2021Updated 4 years ago
tanzelin430 / libsmctrl
View on GitHub
libsmctrl论文的复现，添加了python端接口，可以在python端灵活调用接口来分配计算资源
☆12May 21, 2024Updated 2 years ago
SoilRos / llvm-xray-tools
View on GitHub
Tools for analysing results produced by the llvm-xray instrumentation
☆16Jan 15, 2021Updated 5 years ago
DebashisGanguly / gpgpu-sim_UVMSmart
View on GitHub
☆83Nov 16, 2020Updated 5 years ago
Deep-Learning-Profiling-Tools / fasten
View on GitHub
☆14Apr 24, 2024Updated 2 years ago
haswelliris / CPC2018-GROMACS
View on GitHub
CPC2018第二届国产CPU并行应用挑战赛决赛
☆11Oct 26, 2018Updated 7 years ago
aprylewu / aitimeline
View on GitHub
Conference timeline browser built with Next.js
☆16Apr 5, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
OSU-STARLAB / UVM_benchmark
View on GitHub
☆34Sep 9, 2020Updated 5 years ago
csl-uth / KinectFusion-fpga
View on GitHub
☆11Jan 21, 2021Updated 5 years ago
HPMLL / SpInfer_EuroSys25
View on GitHub
☆35Apr 2, 2025Updated last year
Jeevananthan-23 / ziglang-caches
View on GitHub
In-memory cache implementation with commonly used LRU, W-LFU and S3-FIFO as the eviction policy
☆15Apr 24, 2024Updated 2 years ago
yaobaiwei / PPA-Assembler
View on GitHub
A toolkit for de novo genome assembly based on Pregel.
☆12Dec 16, 2019Updated 6 years ago
midwinter1993 / dogfood
View on GitHub
Artifact evaluation for Dogfood
☆12Feb 22, 2020Updated 6 years ago
MoZeWei / moTuner
View on GitHub
☆10May 12, 2022Updated 4 years ago