flin3500 / Cuda-Google-Colab
The cuda code is mainly for nvidia hardware device. This repo will show how to run cuda c or cuda cpp code on the google colab platform for free.
☆24Updated last year
Alternatives and similar repositories for Cuda-Google-Colab:
Users that are interested in Cuda-Google-Colab are comparing it to the libraries listed below
- A simplified LLAMA implementation for training and inference tasks.☆30Updated 5 months ago
- AMD related optimizations for transformer models☆75Updated 5 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆179Updated last year
- Google TPU optimizations for transformers models☆108Updated 3 months ago
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆13Updated 11 months ago
- ☆17Updated last year
- ☆10Updated 3 years ago
- A minimal version of GPT-2 in 175 lines of PyTorch code.☆41Updated last week
- Learn CUDA with PyTorch☆20Updated 2 months ago
- Inference Llama 2 in one file of pure C☆28Updated last year
- Simple problems implemented in CUDA C☆19Updated 2 weeks ago
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆226Updated 7 months ago
- ⛰️ RockyML - A High-Performance Scientific Computing Framework for Non-smooth Machine Learning Problems☆19Updated 2 years ago
- Learning about CUDA by writing PTX code.☆128Updated last year
- GPT2 implementation in C++ using Ort☆26Updated 4 years ago
- NVIDIA tools guide☆129Updated 3 months ago
- LLM training in simple, raw C/CUDA☆92Updated 11 months ago
- Python bindings for ggml☆140Updated 7 months ago
- NNCG: A Neural Network Code Generator☆35Updated 8 months ago
- Visualising Losses in Deep Neural Networks☆16Updated 9 months ago
- ☆51Updated this week
- Collection of kernels written in Triton language☆120Updated 3 weeks ago
- Some CUDA example code with READMEs.☆94Updated last month
- Code implementation from my blog post: https://fkodom.substack.com/p/transformers-from-scratch-in-pytorch☆93Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆25Updated last year
- ML/DL Math and Method notes☆60Updated last year
- Custom kernels in Triton language for accelerating LLMs☆18Updated last year
- Can RL solve simple problems?☆54Updated last year
- Notes on "Programming Massively Parallel Processors" by Hwu, Kirk, and Hajj (4th ed.)☆52Updated 8 months ago
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Updated 7 months ago