idoheinemann / Assembly-Neural-NetworkLinks
A multy-layer feed-forward neural network implementation in assembly x86 32 bits
☆20Updated 6 years ago
Alternatives and similar repositories for Assembly-Neural-Network
Users that are interested in Assembly-Neural-Network are comparing it to the libraries listed below
Sorting:
- Code implementation from my blog post: https://fkodom.substack.com/p/transformers-from-scratch-in-pytorch☆97Updated 2 years ago
- Neural network from scratch in CUDA/C++☆88Updated 4 months ago
- Visualising Losses in Deep Neural Networks☆16Updated last year
- 11-785 Introduction to Deep Learning (IDeeL) website with logistics and select course materials☆72Updated this week
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- My own repository containing the codes I wrote to practice CUDA programming.☆65Updated 2 years ago
- Convolutional Neural Network implemented from Scratch for MNIST and CIFAR-10 datasets.☆66Updated 3 years ago
- Proximal Policy Optimization (PPO) algorithm for Sonic the Hedgehog☆27Updated 4 years ago
- Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.☆38Updated 11 months ago
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆54Updated last year
- ☆15Updated 7 months ago
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Updated 2 years ago
- a simple implementation of autograd engine☆24Updated 7 years ago
- Material for the Systems and Cognitive NeuroScience online course☆127Updated 9 months ago
- CUDA based GPU Programming☆39Updated last year
- Well documented, unit tested, type checked and formatted implementation of a vanilla transformer - for educational purposes.☆273Updated last year
- A Deep RL Wordle Bot☆12Updated 3 years ago
- Step by step explanation/tutorial of llama2.c☆225Updated 2 years ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆19Updated last year
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 3 years ago
- Tutorial for how to build BERT from scratch☆101Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆202Updated 2 years ago
- Tutorials for Triton, a language for writing gpu kernels☆65Updated 2 years ago
- A really tiny autograd engine☆98Updated 7 months ago
- Implementation of various evolutionary algorithms, starting with evolutionary strategies☆44Updated last week
- Mixed precision training from scratch with Tensors and CUDA☆28Updated last year
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆119Updated 2 years ago
- Unofficial implementation of Tiny Recursive Model (TRM), improvement to HRM from Sapient AI, by Alexia Jolicoeur-Martineau☆167Updated 2 weeks ago
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆257Updated last year
- A codebase implementing a simple GPT-like model from scratch based on the Attention is All You Need paper.☆70Updated 2 years ago