☆15Apr 28, 2023Updated 2 years ago
Alternatives and similar repositories for GPU-Roofline-Python
Users that are interested in GPU-Roofline-Python are comparing it to the libraries listed below
Sorting:
- A Winograd based kernel for convolutions in deep learning framework☆15Jul 22, 2017Updated 8 years ago
- [ICCV 2023] Code for "Minimal Solutions to Generalized Three-View Relative Pose Problem" (oral presentation)☆14Jul 8, 2025Updated 8 months ago
- This repository implements the fast N-view triangulation solver☆26Jul 4, 2024Updated last year
- Code for ICML2023 Paper: Continuation Path Learning for Homotopy Optimization☆13Dec 31, 2025Updated 2 months ago
- An official MATLAB implementation of the paper "A Simple Direct Solution to the Perspective-Three-Point Problem", BMVC2019.☆14Oct 1, 2021Updated 4 years ago
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆93Mar 4, 2026Updated 2 weeks ago
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆50Aug 21, 2018Updated 7 years ago
- A Triton-only attention backend for vLLM☆24Feb 11, 2026Updated last month
- Official repo for BWLer: Barycentric Weight Layer☆29Sep 26, 2025Updated 5 months ago
- ☆10Apr 24, 2023Updated 2 years ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 7 months ago
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- ☆50Jun 27, 2019Updated 6 years ago
- Matlab demo for our CVPR'19 publication: Mapping, Localization and Path Planning for Image-based Navigation using Visual Features and Map☆24Dec 17, 2019Updated 6 years ago
- Fast GPU based tensor core reductions☆13Jan 13, 2023Updated 3 years ago
- ☆13Oct 9, 2023Updated 2 years ago
- A high-performance toolkit for quantum and classical chemistry calculations.☆40Mar 13, 2026Updated last week
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆12Aug 12, 2022Updated 3 years ago
- This new benchmark dataset, Open-Structure, is proposed to evaluate visual odometry and SLAM methods, which directly equips point and lin…☆71Jun 23, 2025Updated 8 months ago
- 对yolov4进行通道剪枝☆15Jun 20, 2022Updated 3 years ago
- NEAT: Distilling 3D Wireframes from Neural Attraction Fields (CVPR 2024)☆73Mar 29, 2024Updated last year
- New batched algorithm for sparse matrix-matrix multiplication (SpMM)☆16May 7, 2019Updated 6 years ago
- A intelligent matrix format designer for SpMV☆10Oct 10, 2023Updated 2 years ago
- PKU Mirror Frontend☆11Apr 5, 2025Updated 11 months ago
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Apr 15, 2015Updated 10 years ago
- ☆111Apr 19, 2024Updated last year
- GPU model checker☆13Apr 17, 2019Updated 6 years ago
- ☆13Jun 23, 2022Updated 3 years ago
- Convolutional Neural Network of vgg19 model using Cuda to accelerate☆12Jun 11, 2018Updated 7 years ago
- ☆17Jul 5, 2024Updated last year
- A comprehensive repository for Compute Express Link (CXL) resources: covering research papers, specifications, simulation/emulation tools…☆23Feb 24, 2026Updated 3 weeks ago
- ☆19Aug 26, 2021Updated 4 years ago
- UNIST blackboard web extension program☆12Apr 20, 2023Updated 2 years ago
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Feb 28, 2019Updated 7 years ago
- ☆11Sep 16, 2024Updated last year
- ☆14Apr 24, 2024Updated last year
- ☆19Dec 3, 2019Updated 6 years ago
- PHP 版本 野生工大助手☆18Jan 18, 2020Updated 6 years ago
- Phys4DGen: A Physics-Driven Framework for Controllable and Efficient 4D Content Generation from a Single Image☆12May 10, 2025Updated 10 months ago