pytorch from scratch in pure C/CUDA and python
☆41Oct 10, 2024Updated last year
Alternatives and similar repositories for lilgrad
Users that are interested in lilgrad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- creating a tiny tensor library in raw C☆1,455Mar 5, 2025Updated last year
- Andrej Kapathy's micrograd implemented in c☆30Aug 7, 2024Updated last year
- A universal thread-safe memory pool.☆26Jul 20, 2018Updated 7 years ago
- This repository contains a C implementation of matrix multiplication with various optimization techniques.☆15Jun 7, 2025Updated 11 months ago
- Tic-Tac-Toe game in C☆15Feb 12, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)☆13Jun 11, 2025Updated 11 months ago
- ☆12Sep 25, 2024Updated last year
- Neural network in C for recognizing american sign language(ASL) from scratch on the MNIST dataset. Optimized with parallel training. Cann…☆38Aug 26, 2024Updated last year
- simply neural networks in every language☆41Nov 24, 2025Updated 6 months ago
- This repository demonstrates the application of our proposed task-free continual learning method on a synthetic experiment.☆13Jun 24, 2019Updated 6 years ago
- A guide that explains how programs transform from source code to executables. Deep dive into ELF format, linking processes, and binary op…☆365Jul 20, 2025Updated 10 months ago
- A tool allowing students of Coursera's Heterogeneous Parallel Programming to work on homework using a machine without a CUDA GPU.☆11Mar 11, 2015Updated 11 years ago
- Reference implementation of Mistral AI 7B v0.1 model.☆28Dec 25, 2023Updated 2 years ago
- Mixed precision training from scratch with Tensors and CUDA☆30May 14, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A PyTorch Implementation of Neural Turing Machine☆14Jul 24, 2020Updated 5 years ago
- A zero-dependency ML framework in C with a modern Python API for full control over execution and memory.☆685May 20, 2026Updated last week
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 3 months ago
- 小彭老师推出 SyCL 2020 课程(施工中,日后会在直播中放出)☆15Sep 3, 2023Updated 2 years ago
- Optimized parallel training implementation of a neural network in C for recognizing handwritten digits from scratch on the MNIST dataset☆87Aug 20, 2024Updated last year
- A C++ memory context☆11Jul 28, 2021Updated 4 years ago
- A basic numpy like library for micropython☆18Feb 11, 2020Updated 6 years ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- llama3.cuda is a pure C/CUDA implementation for Llama 3 model.☆350Apr 27, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A JIT compiler implemented with MLIR/LLVM for faster query processing in SQLite☆20Jan 3, 2023Updated 3 years ago
- One File Tensor Libraries☆31Oct 7, 2025Updated 7 months ago
- A C compiler, written in Rust.☆10Feb 13, 2022Updated 4 years ago
- C++ pipeline with OpenVINO native API for Stable Diffusion v1.5☆13Feb 23, 2024Updated 2 years ago
- Agar.io for Continual Reinforcement Learning☆24Jul 24, 2025Updated 10 months ago
- JAX implementations of RWKV☆18Sep 26, 2023Updated 2 years ago
- ☆27Apr 13, 2025Updated last year
- C++17 Neural Network (NN), Convolutional Neural Network (CNN) and Deep Learning for Esp32 on IDF from scratch☆24Aug 23, 2023Updated 2 years ago
- ☆11May 16, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- parse the LHCb nightlies compile_commands.json and create compiler-explorer c++.properties path☆13Dec 31, 2021Updated 4 years ago
- Work-in-progress TAS tools for A Hat in Time☆11Aug 3, 2019Updated 6 years ago
- Smart contracts for a home rental network with IoT doorlocks☆11Jun 5, 2018Updated 7 years ago
- Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023☆21Nov 4, 2024Updated last year
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆30May 20, 2026Updated last week
- An implementation of the paper "Overcoming catastrophic forgetting in neural networks" (DeepMind, 2016), using Pytorch framework.☆16Sep 9, 2018Updated 7 years ago