Apply GPU in ML and DL
☆62Mar 16, 2026Updated last week
Alternatives and similar repositories for GPU-in-ML-DL
Users that are interested in GPU-in-ML-DL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- NVIDIA tools guide☆164Jan 7, 2025Updated last year
- RAPIDS Deployment Documentation☆15Mar 11, 2026Updated 2 weeks ago
- This repository documents my 100-day journey of learning and writing CUDA kernels.☆27Jun 25, 2025Updated 9 months ago
- ☆15Feb 13, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LockManager with deadlock detection for implementing 2PL☆13Mar 13, 2019Updated 7 years ago
- Comparing Deep Learning Inference of Pytorch models running on CPU, CUDA and TensorRT☆16Feb 20, 2022Updated 4 years ago
- ☆91Feb 29, 2024Updated 2 years ago
- ☆463Dec 18, 2025Updated 3 months ago
- Parse objdump files using tree-sitter☆13Nov 22, 2023Updated 2 years ago
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- ☆28Jun 3, 2024Updated last year
- Read custom dataset☆12Mar 31, 2023Updated 2 years ago
- ☆11Jun 9, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆25Aug 29, 2025Updated 6 months ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- Reimplementation of NeRF (Neural Radiance Fields) (ECCV2020)☆10May 4, 2023Updated 2 years ago
- ☆14Mar 7, 2025Updated last year
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- Finetuning BLOOM on a single GPU using gradient-accumulation☆31Mar 29, 2023Updated 2 years ago
- Blockchain Technologies - The course is divided into [6 modules]: 1. Distributed Systems & Consensus, 2. Cryptoeconomics & Proof-of-Stake…☆15Oct 21, 2025Updated 5 months ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆440Feb 22, 2025Updated last year
- Vector Index Benchmark for Embeddings (VIBE) is an extensible benchmark for approximate nearest neighbor search methods, or vector index…☆36Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆93Nov 11, 2025Updated 4 months ago
- A notebook testing CPU speed vs GPU speed with Pytorch and CUDA☆17Dec 25, 2021Updated 4 years ago
- Implementation from scratch in CUDA C++ of image processing algorithms.☆22Oct 26, 2020Updated 5 years ago
- This repository will contain links to the most famous available books of ML that are online☆12Oct 15, 2024Updated last year
- Flash Attention Triton kernel with support for second-order derivatives☆149Mar 10, 2026Updated 2 weeks ago
- ☆11Nov 6, 2019Updated 6 years ago
- ☆23Jul 11, 2025Updated 8 months ago
- Energy Consumption-Aware Tabular Benchmark For Neural Architecture Search☆11Aug 18, 2025Updated 7 months ago
- Implement Neural Networks in Cuda from Scratch☆24May 17, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Local Action, Global Impact (Selected as Top 50 in the 2022 Solution Challenge.)☆16Jan 18, 2024Updated 2 years ago
- A simple Python script to convert FOA audio to binaural.☆15Nov 29, 2022Updated 3 years ago
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆876Mar 29, 2025Updated 11 months ago
- A Deep Learning-based Real-time Object Detector for DJI Drones☆12Oct 5, 2018Updated 7 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆12Nov 12, 2022Updated 3 years ago
- Yet Another Finite Element Library☆10Sep 3, 2020Updated 5 years ago