sujunyan / tex-galleryLinks
☆14Updated 2 years ago
Alternatives and similar repositories for tex-gallery
Users that are interested in tex-gallery are comparing it to the libraries listed below
Sorting:
- [EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs☆79Updated last year
- ☆29Updated 2 years ago
- PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity☆121Updated last month
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Updated 2 years ago
- A telegram bot that sends you a message when the GPU is in use☆10Updated last year
- 北京大学本科生毕业论文 latex 模版,基于 pkuthss 1.9.0 修改☆27Updated 3 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Updated 5 years ago
- ☆49Updated 4 years ago
- ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch☆39Updated 10 months ago
- Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.☆34Updated 6 years ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Updated 2 years ago
- A collection of min-cut/max-flow algorithms.☆50Updated 3 years ago
- ☆36Updated 2 years ago
- Beamer template with CUHK colors and logos☆39Updated 4 years ago
- My paper/code reading notes in Chinese☆46Updated 8 months ago
- pytorch cuda extension of grid_sample1d☆49Updated 4 years ago
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆49Updated 7 months ago
- tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)☆29Updated 2 years ago
- 兆京大学班车预约 for Humans™☆30Updated 2 months ago
- ☆112Updated 4 years ago
- [MobiCom 24] Efficient and Adaptive DNN inference under changeable memory budgets☆58Updated last year
- Utilities for paper writing.☆12Updated last month
- the CPU implementation of bucket based farthest point sampling, achieves 7-81x speedup than the conventional implementation☆26Updated 2 years ago
- A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention☆278Updated 2 months ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆58Updated last year
- SJTU thesis template version 2021, modified from the official version☆46Updated 4 years ago
- This is the unofficial LaTeX class for Master/Ph.D. Thesis Template of Huazhong University of Science and Technology☆32Updated 2 years ago
- a size profiler for cuda binary☆72Updated 3 weeks ago
- Gaussian Splating 2d implemented in triton☆11Updated last year
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆95Updated 2 years ago