muchlakshay / Dual-Backend-MLP-From-Scratch-CUDALinks
A fully from-scratch Multi-Layer Perceptron built in CUDA C++ with support for both GPU and CPU training. Includes multiple activation and loss functions, a clean and modular architecture, and an easy-to-use API, all without relying on external machine learning libraries.
☆19Updated 2 months ago
Alternatives and similar repositories for Dual-Backend-MLP-From-Scratch-CUDA
Users that are interested in Dual-Backend-MLP-From-Scratch-CUDA are comparing it to the libraries listed below
Sorting:
- 100 Days of GPU Challenge☆24Updated last month
- ☆72Updated 5 months ago
- A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.☆45Updated 3 weeks ago
- This Repository demostrates various examples using YOLO☆13Updated last year
- Official Project Page for Deep Delta Learning (https://huggingface.co/papers/2601.00417)☆282Updated this week
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆21Updated 6 months ago
- An agent to generate stunning images :)☆23Updated 7 months ago
- Code for paper "Analog Foundation Models"☆27Updated 3 months ago
- EdgeSAM model for use with Autodistill.☆29Updated last year
- Train, tune, and infer Bamba model☆137Updated 7 months ago
- ☆46Updated 9 months ago
- UQ: Assessing Language Models on Unsolved Questions☆29Updated 4 months ago
- ☆26Updated last year
- ☆19Updated 10 months ago
- Repository to create traveling waves integrate special information through time☆56Updated 5 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆44Updated 5 months ago
- XmodelLM☆38Updated last year
- Code for Bolmo: Byteifying the Next Generation of Language Models☆112Updated 2 weeks ago
- Unofficial implementation of Tiny Recursive Model (TRM), improvement to HRM from Sapient AI, by Alexia Jolicoeur-Martineau☆167Updated 2 weeks ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 10 months ago
- ☆154Updated 3 months ago
- ☆182Updated 5 months ago
- Official Implementation of Dynamic erf (Derf).☆100Updated last month
- Resa: Transparent Reasoning Models via SAEs☆47Updated 3 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated last year
- Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, and full end-to-end reference exampl…☆314Updated this week
- alternative way to calculating self attention☆18Updated last year
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆64Updated 8 months ago
- Making of cuda kernel☆17Updated 7 months ago
- ☆20Updated 10 months ago