☆23Feb 16, 2022Updated 4 years ago
Alternatives and similar repositories for CUDACommunityMeetup2021
Users that are interested in CUDACommunityMeetup2021 are comparing it to the libraries listed below
Sorting:
- A intelligent matrix format designer for SpMV☆10Oct 10, 2023Updated 2 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Jun 14, 2023Updated 2 years ago
- ☆12Dec 21, 2023Updated 2 years ago
- Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.☆46Sep 24, 2025Updated 5 months ago
- ☆12Jan 19, 2020Updated 6 years ago
- ❤️ CUDA/C++ GPU graph analytics simplified.☆32Sep 19, 2022Updated 3 years ago
- MD5 core in verilog☆13May 1, 2012Updated 13 years ago
- Personal notes about Fortran programming language☆13Jun 3, 2021Updated 4 years ago
- Generic exascale-ready library for halo-exchange operations on variety of grids/meshes☆10Updated this week
- PaStiX (Parallel Sparse matriX package) solver library☆20Nov 20, 2018Updated 7 years ago
- Advanced Parallel Programming☆21Mar 16, 2021Updated 5 years ago
- End to End steps for adding custom ops in PyTorch.☆24Aug 20, 2020Updated 5 years ago
- CUDA executors☆14Dec 4, 2020Updated 5 years ago
- ☆22Updated this week
- An extension library of WMMA API (Tensor Core API)☆111Jul 12, 2024Updated last year
- SMASH is a hardware-software cooperative mechanism that enables highly-efficient indexing and storage of sparse matrices. The key idea of…☆18May 17, 2020Updated 5 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Aug 19, 2019Updated 6 years ago
- ☆28Sep 28, 2022Updated 3 years ago
- ☆11Jun 9, 2023Updated 2 years ago
- 一个用Apple Metal实现的Llama和通义千问大模型本地推理☆10Apr 26, 2024Updated last year
- Global Address SPace toolbox -- Julia wrapper☆10Nov 17, 2017Updated 8 years ago
- The implementation of Paper “Multiparameter modeling with ANN for antenna design ”.☆14Apr 13, 2020Updated 5 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆66Sep 9, 2025Updated 6 months ago
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆24Updated this week
- 自动识别文本中的关键词并加粗处理。☆10Oct 30, 2024Updated last year
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Apr 15, 2015Updated 10 years ago
- An LLM-powered chatbot for fediverse. A tech demo for BotKit.☆14Dec 20, 2025Updated 3 months ago
- Range-based for loops to iterate over a range of numbers or values☆34Nov 23, 2016Updated 9 years ago
- Website for CS 265☆33Dec 27, 2024Updated last year
- GPU model checker☆13Apr 17, 2019Updated 6 years ago
- An FPGA design for simulating biological neurons☆17Jul 5, 2024Updated last year
- Asynchronous Multi-GPU Programming Framework☆48Jun 8, 2021Updated 4 years ago
- ☆14Oct 20, 2021Updated 4 years ago
- Caffe: a fast open framework for deep learning.☆14Aug 26, 2015Updated 10 years ago
- A cross-platform dotfiles manager☆14Jan 19, 2026Updated 2 months ago
- Tutorial of OpenGL ES using PowerVR framework☆12Jan 4, 2023Updated 3 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- The Singularity Community Catalog of Singularity* recipe files.☆11Oct 27, 2025Updated 4 months ago
- ☆14Oct 5, 2023Updated 2 years ago