☆17Jul 1, 2020Updated 5 years ago
Alternatives and similar repositories for gpu_sgemm
Users that are interested in gpu_sgemm are comparing it to the libraries listed below
Sorting:
- The SEAL-CPU backend is a Reference backend engine for HEBench which is a shared library that implements the required functions specified…☆11Mar 3, 2023Updated 3 years ago
- ☆40Apr 3, 2022Updated 3 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆35Jul 28, 2020Updated 5 years ago
- ☆40Feb 28, 2020Updated 6 years ago
- Driver to measure vmlaunch latency☆10Jun 28, 2022Updated 3 years ago
- Implementation of the TFHE homomorphic encryption scheme.☆12May 14, 2021Updated 4 years ago
- ☆48Dec 11, 2020Updated 5 years ago
- A project that patch the xiaomi linux system which can connect to chatGPT with WebRTC and Websocket☆10Aug 29, 2025Updated 6 months ago
- Apache Solr: Because your Database is not a Search Engine☆12Feb 27, 2019Updated 7 years ago
- Source code comments☆11Feb 13, 2026Updated 2 weeks ago
- This image builds a T-Rex CUDA miner.☆10Dec 4, 2022Updated 3 years ago
- GPU implementation of Winograd convolution☆10Oct 23, 2017Updated 8 years ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆13Jun 28, 2025Updated 8 months ago
- ☆11Jan 5, 2022Updated 4 years ago
- Memory management simulator, using Hashed Page Table. Page Replacement Algorithms: Least Recently Used (LRU) and Second Chance.☆10Apr 12, 2021Updated 4 years ago
- ☆11Mar 4, 2021Updated 4 years ago
- Explore the behavior SystemC kernel event-driven simulator (aka "the engine")☆12Jan 17, 2024Updated 2 years ago
- Verilog RTL Implementation of DNN☆10Jun 26, 2018Updated 7 years ago
- Automatic ReLU Reduction☆15Dec 20, 2023Updated 2 years ago
- A High-Performance Side-Channel-Resistant AES on GPUs☆13May 9, 2019Updated 6 years ago
- Repo for PyChart 1.39, refs http://download.gna.org/pychart/☆10Sep 29, 2014Updated 11 years ago
- Secure Inference Resilient Against Malicious Clients☆15May 3, 2022Updated 3 years ago
- A sparse BLAS lib supporting multiple backends☆50Nov 23, 2025Updated 3 months ago
- A Python tool to measure the energy consumption of software☆14Feb 5, 2026Updated 3 weeks ago
- ☆13Mar 8, 2023Updated 2 years ago
- ☆10Feb 11, 2023Updated 3 years ago
- Ripple: Accelerating Programmable Bootstraps for FHE with Wavelet Approximations☆12Aug 8, 2024Updated last year
- Tool kit to accelerate exploratory data analysis and data cleaning☆11Mar 22, 2021Updated 4 years ago
- A repository for code used in the paper "On the precision loss in approximate homomorphic encryption"☆10Jan 16, 2025Updated last year
- Multiple 1-stencil implementations using nvidia cuda.☆13Dec 2, 2017Updated 8 years ago
- cURL + Python Weibo Wrapper.☆10Dec 8, 2017Updated 8 years ago
- LLVM Plugin to Instrument Global Memory Accesses in CUDA Kernels☆10Jun 8, 2020Updated 5 years ago
- Generate visual studio solution from a bazel workspace.☆13Jan 19, 2022Updated 4 years ago
- Repo to hold HammerBlade PyTorch port. Based on PyTorch v1.4.0☆14Oct 4, 2022Updated 3 years ago
- GVProf: A Value Profiler for GPU-based Clusters☆53Mar 24, 2024Updated last year
- We use LSTM (Long Short-Term Memory) and BERT based models to carry out modeling and visualisation of the tweets based on Covid-19 in the…☆10Sep 15, 2021Updated 4 years ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- MySQLdb is a Python DB API-2.0 compliant library to interact with MySQL 3.23-5.1 (unofficial mirror)☆15Feb 17, 2019Updated 7 years ago
- ☆14Jul 23, 2025Updated 7 months ago