muchlakshay / Dual-Backend-MLP-From-Scratch-CUDALinks

A fully from-scratch Multi-Layer Perceptron built in CUDA C++ with support for both GPU and CPU training. Includes multiple activation and loss functions, a clean and modular architecture, and an easy-to-use API, all without relying on external machine learning libraries.

☆19

Alternatives and similar repositories for Dual-Backend-MLP-From-Scratch-CUDA

Users that are interested in Dual-Backend-MLP-From-Scratch-CUDA are comparing it to the libraries listed below

Sorting:

Sayandip170900 / CUDA-Challenge
100 Days of GPU Challenge
☆24Updated last month
EPFL-VILAB / fm-vision-evals
☆72Updated 5 months ago
zlab-princeton / llm-pruning-collection
A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.
☆45Updated 3 weeks ago
antar-ai / yolo-examples
This Repository demostrates various examples using YOLO
☆13Updated last year
yifanzhang-pro / deep-delta-learning
Official Project Page for Deep Delta Learning (https://huggingface.co/papers/2601.00417)
☆282Updated this week
Lossfunk / KernelBench-v2
KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems
☆21Updated 6 months ago
run-llama / image-generation-agent
An agent to generate stunning images :)
☆23Updated 7 months ago
IBM / analog-foundation-models
Code for paper "Analog Foundation Models"
☆27Updated 3 months ago
autodistill / autodistill-grounded-edgesam
EdgeSAM model for use with Autodistill.
☆29Updated last year
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆137Updated 7 months ago
kmohan321 / Research_Papers
☆46Updated 9 months ago
uq-project / UQ
UQ: Assessing Language Models on Unsolved Questions
☆29Updated 4 months ago
RobinGerster7 / OSSA
☆26Updated last year
ZihanWang314 / coeCheck
☆19Updated 10 months ago
KempnerInstitute / traveling-waves-integrate
Repository to create traveling waves integrate special information through time
☆56Updated 5 months ago
MaxBelitsky / cache-steering
KV Cache Steering for Inducing Reasoning in Small Language Models
☆44Updated 5 months ago
XiaoduoAILab / XmodelLM
XmodelLM
☆38Updated last year
allenai / bolmo-core
Code for Bolmo: Byteifying the Next Generation of Language Models
☆112Updated 2 weeks ago
lucidrains / tiny-recursive-model
Unofficial implementation of Tiny Recursive Model (TRM), improvement to HRM from Sapient AI, by Alexia Jolicoeur-Martineau
☆167Updated 2 weeks ago
mmhamdy / open-language-models
A list of language models with permissive licenses such as MIT or Apache 2.0
☆24Updated 10 months ago
galilai-group / llm-jepa
☆154Updated 3 months ago
huggingface / large-scale-image-deduplication
☆182Updated 5 months ago
zlab-princeton / Derf
Official Implementation of Dynamic erf (Derf).
☆100Updated last month
shangshang-wang / Resa
Resa: Transparent Reasoning Models via SAEs
☆47Updated 3 months ago
ashishpatel26 / ai-tutor-rag-system
This is a repository for the course "From Beginner to LLM Developer" by Towards AI.
☆12Updated last year
NVIDIA-NeMo / Nemotron
Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, and full end-to-end reference exampl…
☆314Updated this week
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
dusty-nv / NanoDB
Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP
☆64Updated 8 months ago
ThinamXx / cuda-mode
Making of cuda kernel
☆17Updated 7 months ago
harishsg993010 / HawkinsRAG
☆20Updated 10 months ago