Apply GPU in ML and DL
☆68Mar 23, 2026Updated last month
Alternatives and similar repositories for GPU-in-ML-DL
Users that are interested in GPU-in-ML-DL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- RAPIDS Deployment Documentation☆15Apr 17, 2026Updated 2 weeks ago
- This repository documents my 100-day journey of learning and writing CUDA kernels.☆30Mar 29, 2026Updated last month
- Universal differential equations for ecologists☆14Apr 24, 2026Updated last week
- LockManager with deadlock detection for implementing 2PL☆13Mar 13, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Comparing Deep Learning Inference of Pytorch models running on CPU, CUDA and TensorRT☆16Feb 20, 2022Updated 4 years ago
- ☆91Feb 29, 2024Updated 2 years ago
- ☆475Dec 18, 2025Updated 4 months ago
- Parse objdump files using tree-sitter☆13Nov 22, 2023Updated 2 years ago
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- Read custom dataset☆12Mar 31, 2023Updated 3 years ago
- ☆11Jun 9, 2023Updated 2 years ago
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆14Jan 9, 2023Updated 3 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- Finetuning BLOOM on a single GPU using gradient-accumulation☆32Mar 29, 2023Updated 3 years ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆450Feb 22, 2025Updated last year
- Vector Index Benchmark for Embeddings (VIBE) is an extensible benchmark for approximate nearest neighbor search methods, or vector index…☆37Mar 23, 2026Updated last month
- ☆95Nov 11, 2025Updated 5 months ago
- This repository will contain links to the most famous available books of ML that are online☆13Oct 15, 2024Updated last year
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆14Dec 3, 2021Updated 4 years ago
- Energy Consumption-Aware Tabular Benchmark For Neural Architecture Search☆11Aug 18, 2025Updated 8 months ago
- Flash Attention Triton kernel with support for second-order derivatives☆164Mar 10, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆34Updated this week
- A notebook testing CPU speed vs GPU speed with Pytorch and CUDA☆18Dec 25, 2021Updated 4 years ago
- Ecoacoustic analysis platform empowering conservationists to analyze acoustic data and to derive insights about the ecosystem at scale☆18Updated this week
- ☆10Feb 18, 2022Updated 4 years ago
- Dockerfile to create Homegear images☆10Sep 15, 2025Updated 7 months ago
- Local Action, Global Impact (Selected as Top 50 in the 2022 Solution Challenge.)☆17Jan 18, 2024Updated 2 years ago
- ☆10Nov 16, 2024Updated last year
- A simple Python script to convert FOA audio to binaural.☆16Nov 29, 2022Updated 3 years ago
- A Deep Learning-based Real-time Object Detector for DJI Drones☆12Oct 5, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆910Mar 29, 2025Updated last year
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆12Nov 12, 2022Updated 3 years ago
- Training framework for Large Behavioral Models☆28Sep 17, 2025Updated 7 months ago
- Minimal TPU implementation with 8x8 systolic array and PyTorch integration☆60Jan 26, 2026Updated 3 months ago
- A collection of GPU experiments and benchmarks for my personal understanding and research.☆30Apr 9, 2026Updated 3 weeks ago
- A Beginner's Guide to Monetizing Your Python AI Chatbot☆16Apr 22, 2025Updated last year