akhuntsaria / canny-edge-detection
Canny edge detector implemented in CUDA C/C++
☆26Updated last month
Alternatives and similar repositories for canny-edge-detection:
Users that are interested in canny-edge-detection are comparing it to the libraries listed below
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆214Updated 3 months ago
- Learnings and programs related to CUDA☆373Updated last month
- Alex Krizhevsky's original code from Google Code☆191Updated 9 years ago
- machine learning from absolute scratch in c. gradients, linear algebra ops & everything else without using any third party library!☆22Updated 8 months ago
- This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mast…☆308Updated last month
- ☆40Updated 3 weeks ago
- Learning about CUDA by writing PTX code.☆125Updated last year
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆251Updated 4 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 5 months ago
- Tensor library with autograd using only Rust's standard library☆67Updated 9 months ago
- Setting up Vscode to work with Pytorch in C/C++ with CUDA support☆25Updated last month
- ☆212Updated last week
- From zero to hero CUDA for accelerating maths and machine learning on GPU.☆181Updated last week
- Question paper of courses taught at IISC as part of MTech AI curriculum☆58Updated 4 months ago
- GPU Kernels☆157Updated this week
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆170Updated 8 months ago
- 100 days of building GPU kernels!☆321Updated this week
- ☆46Updated this week
- The Tensor (or Array)☆427Updated 7 months ago
- ☆18Updated 2 weeks ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆169Updated last week
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆62Updated last week
- A deep dive on the history of robotics and the future of humanoids☆70Updated 3 months ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆343Updated last month
- A 120-day CUDA learning plan covering daily concepts, exercises, pitfalls, and references (including “Programming Massively Parallel Proc…☆633Updated this week
- Mapping ping with a simple script and Ordinary Kriging to interpolate sparse measurements into a nice visualization!☆80Updated 5 months ago
- ☆209Updated last week
- parallelized hyperdimensional tictactoe☆118Updated 7 months ago
- Accelerated General (FP32) Matrix Multiplication from scratch in CUDA☆111Updated 2 months ago
- High Quality Resources on GPU Programming/Architecture☆584Updated 8 months ago