pytorch from scratch in pure C/CUDA and python
☆41Oct 10, 2024Updated last year
Alternatives and similar repositories for lilgrad
Users that are interested in lilgrad are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A universal thread-safe memory pool.☆26Jul 20, 2018Updated 7 years ago
- This repository contains a C implementation of matrix multiplication with various optimization techniques.☆15Jun 7, 2025Updated last year
- Tic-Tac-Toe game in C☆15Feb 12, 2025Updated last year
- ☆21Oct 9, 2024Updated last year
- a simple numpy alternative in C☆31Sep 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Real-Time RTUs☆12Mar 20, 2026Updated 2 months ago
- C++20 Memory Allocator library☆37Apr 30, 2025Updated last year
- ☆12Sep 25, 2024Updated last year
- Profile your CoreML models directly from Python 🐍☆30Sep 8, 2025Updated 9 months ago
- simply neural networks in every language☆42Nov 24, 2025Updated 6 months ago
- simple RISC-V 64bit emulator, which can boot linux kernel.☆12Oct 16, 2023Updated 2 years ago
- A MCDR plugin for post items☆10Aug 25, 2025Updated 9 months ago
- Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)☆12Jun 20, 2025Updated 11 months ago
- ☆1,281Oct 4, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A guide that explains how programs transform from source code to executables. Deep dive into ELF format, linking processes, and binary op…☆366Jul 20, 2025Updated 10 months ago
- A collection of user interface widgets in Python for use in programs using Pygame.☆17Aug 19, 2024Updated last year
- A tool allowing students of Coursera's Heterogeneous Parallel Programming to work on homework using a machine without a CUDA GPU.☆11Mar 11, 2015Updated 11 years ago
- A c/RISCV of "Let's Build a Compiler" by Jack Crenshaw☆124Sep 26, 2022Updated 3 years ago
- Reference implementation of Mistral AI 7B v0.1 model.☆28Dec 25, 2023Updated 2 years ago
- Milkv-duo Cross Compile software☆10Jun 8, 2023Updated 3 years ago
- Dictionary, like stl map, in C, as a single-file header, with convenience functionality for strings, ints, and indexing.☆23Dec 29, 2025Updated 5 months ago
- Mixed precision training from scratch with Tensors and CUDA☆30May 14, 2024Updated 2 years ago
- Luogu plugin on IntelliJ Platform☆10Jan 6, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A tinycompiler in C from scratch☆107Aug 3, 2024Updated last year
- A PyTorch Implementation of Neural Turing Machine☆14Jul 24, 2020Updated 5 years ago
- A zero-dependency ML framework in C with a modern Python API for full control over execution and memory.☆688Updated this week
- Exercises for Learning MLIR (Originally written for PPoPP 2026)☆104Feb 5, 2026Updated 4 months ago
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆17Dec 24, 2017Updated 8 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- 小彭 老师推出 SyCL 2020 课程(施工中,日后会在直播中放出)☆15Sep 3, 2023Updated 2 years ago
- Open-Source Competitive Programming Atlas of Algorithms and Data Structures☆12Feb 21, 2025Updated last year
- Optimized parallel training implementation of a neural network in C for recognizing handwritten digits from scratch on the MNIST dataset☆87Aug 20, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Educational collection of LLVM obfuscation passes. (Feel free to use it for your course)☆35Apr 22, 2026Updated last month
- A C++ memory context☆11Jul 28, 2021Updated 4 years ago
- A basic numpy like library for micropython☆18Feb 11, 2020Updated 6 years ago
- llama3.cuda is a pure C/CUDA implementation for Llama 3 model.☆350Apr 27, 2025Updated last year
- Contains Deep Learning Code implemented on generic CPUs & Intel Xeon Phi Coprocessors☆12May 2, 2016Updated 10 years ago
- An open-source multi-purpose smoothed particle hydrodynamics (SPH) / N-body hybrid code☆10Dec 6, 2017Updated 8 years ago
- One File Tensor Libraries☆31Oct 7, 2025Updated 8 months ago