muchlakshay / Dual-Backend-MLP-From-Scratch-CUDALinks
A fully from-scratch Multi-Layer Perceptron built in CUDA C++ with support for both GPU and CPU training. Includes multiple activation and loss functions, a clean and modular architecture, and an easy-to-use API, all without relying on external machine learning libraries.
☆19Updated 2 months ago
Alternatives and similar repositories for Dual-Backend-MLP-From-Scratch-CUDA
Users that are interested in Dual-Backend-MLP-From-Scratch-CUDA are comparing it to the libraries listed below
Sorting:
- 100 Days of GPU Challenge☆23Updated last month
- Repository to create traveling waves integrate special information through time☆55Updated 2 months ago
- ☆46Updated 6 months ago
- Generic MCP Client to use any MCP tool in a chat☆43Updated 5 months ago
- Building LLMs from scratch following the book from S. Raschka☆32Updated 6 months ago
- ☆160Updated 3 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 7 months ago
- An agent to generate stunning images :)☆23Updated 4 months ago
- ☆72Updated 3 months ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated last week
- ☆16Updated 4 months ago
- ☆26Updated last year
- alternative way to calculating self attention☆18Updated last year
- This Repository demostrates various examples using YOLO☆13Updated last year
- EdgeSAM model for use with Autodistill.☆29Updated last year
- ☆21Updated 11 months ago
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- Hands-On Learning in Computer Vision☆26Updated this week
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆161Updated last month
- Eye exploration☆29Updated 8 months ago
- Code for paper "Analog Foundation Models"☆27Updated last month
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Updated 2 years ago
- Improving langchain knowledge graphs using baml☆33Updated 2 months ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆50Updated last year
- Fine tune Gemma 3 on an object detection task☆86Updated 3 months ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Updated last week
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆22Updated last week
- Coding an LLM and its building blocks from scratch.☆97Updated 6 months ago