☆23Feb 16, 2022Updated 4 years ago
Alternatives and similar repositories for CUDACommunityMeetup2021
Users that are interested in CUDACommunityMeetup2021 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Statistics on GPUs☆33Apr 18, 2026Updated last week
- RAPIDS Deployment Documentation☆15Apr 17, 2026Updated 2 weeks ago
- ☆17Feb 26, 2020Updated 6 years ago
- A intelligent matrix format designer for SpMV☆10Oct 10, 2023Updated 2 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Jun 14, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Feb 13, 2018Updated 8 years ago
- Comparing Deep Learning Inference of Pytorch models running on CPU, CUDA and TensorRT☆16Feb 20, 2022Updated 4 years ago
- ☆12Jan 19, 2020Updated 6 years ago
- ❤️ CUDA/C++ GPU graph analytics simplified.☆32Sep 19, 2022Updated 3 years ago
- MD5 core in verilog☆13May 1, 2012Updated 14 years ago
- Generic exascale-ready library for halo-exchange operations on variety of grids/meshes☆10Mar 28, 2026Updated last month
- Read custom dataset☆12Mar 31, 2023Updated 3 years ago
- Advanced Parallel Programming☆21Mar 16, 2021Updated 5 years ago
- End to End steps for adding custom ops in PyTorch.☆24Aug 20, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PaStiX (Parallel Sparse matriX package) solver library☆20Nov 20, 2018Updated 7 years ago
- ☆14Apr 10, 2023Updated 3 years ago
- An extension library of WMMA API (Tensor Core API)☆111Jul 12, 2024Updated last year
- SMASH is a hardware-software cooperative mechanism that enables highly-efficient indexing and storage of sparse matrices. The key idea of…☆18May 17, 2020Updated 5 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Aug 19, 2019Updated 6 years ago
- Finetuning BLOOM on a single GPU using gradient-accumulation☆32Mar 29, 2023Updated 3 years ago
- ☆28Sep 28, 2022Updated 3 years ago
- Parallel_Computer_Architecture经典书籍☆17May 13, 2022Updated 3 years ago
- Cuda matrix computation library that is specified for small matrix operation (3x3, 4x4, 1x3, 1x4, etc.). Including buffer☆19Mar 8, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆97Aug 14, 2023Updated 2 years ago
- Implementation from scratch in CUDA C++ of image processing algorithms.☆22Oct 26, 2020Updated 5 years ago
- ☆114Apr 19, 2024Updated 2 years ago
- ☆25Updated this week
- Implement Neural Networks in Cuda from Scratch☆24May 17, 2024Updated last year
- 一个用Apple Metal实现的Llama和通义千问大模型本地推理☆10Apr 26, 2024Updated 2 years ago
- Global Address SPace toolbox -- Julia wrapper☆10Nov 17, 2017Updated 8 years ago
- This is the repository for codes in paper "ShaderPerFormer: Platform-independent Context-aware Shader Performance Predictor"☆12May 16, 2024Updated last year
- FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)☆67Apr 9, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆26Updated this week
- 🎃 GPU load-balancing library for regular and irregular computations.☆66Sep 9, 2025Updated 7 months ago
- The implementation of Paper “Multiparameter modeling with ANN for antenna design”.☆15Apr 13, 2020Updated 6 years ago
- 自动识别文本中的关键词并加粗处理。☆10Oct 30, 2024Updated last year
- An LLM-powered chatbot for fediverse. A tech demo for BotKit.☆14Dec 20, 2025Updated 4 months ago
- A Lightweight Graph Processing Framework for Multi-GPUs☆14Apr 15, 2015Updated 11 years ago
- Range-based for loops to iterate over a range of numbers or values☆34Nov 23, 2016Updated 9 years ago