flin3500 / Cuda-Google-Colab
The cuda code is mainly for nvidia hardware device. This repo will show how to run cuda c or cuda cpp code on the google colab platform for free.
☆24Updated last year
Alternatives and similar repositories for Cuda-Google-Colab:
Users that are interested in Cuda-Google-Colab are comparing it to the libraries listed below
- asynchronous/distributed speculative evaluation for llama3☆39Updated 7 months ago
- A simplified LLAMA implementation for training and inference tasks.☆30Updated 4 months ago
- Google TPU optimizations for transformers models☆104Updated 2 months ago
- Evaluate Transformers from the Hub 🔥☆13Updated last year
- High-Performance SGEMM on CUDA devices☆87Updated 2 months ago
- Learning about CUDA by writing PTX code.☆125Updated last year
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆217Updated 6 months ago
- GPU documentation for humans☆32Updated this week
- Visualize ONNX models with model-explorer☆31Updated 3 weeks ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆177Updated last year
- ☆10Updated 3 years ago
- ☆47Updated this week
- Python bindings for ggml☆140Updated 6 months ago
- Random number library that generate pseudo-random and quasi-random numbers.☆26Updated this week
- ⛰️ RockyML - A High-Performance Scientific Computing Framework for Non-smooth Machine Learning Problems☆19Updated last year
- Neural search engine for discovering semantically similar Python repositories on GitHub☆27Updated last year
- Count GitHub Stars ⭐☆29Updated this week
- ML/DL Math and Method notes☆59Updated last year
- 👷 Build compute kernels☆24Updated this week
- ☆17Updated last year
- minimal C implementation of speculative decoding based on llama2.c☆20Updated 8 months ago
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Updated 6 months ago
- Inference Llama 2 in one file of pure C☆28Updated last year
- A minimal version of GPT-2 in 175 lines of PyTorch code.☆40Updated this week
- 3X speedup over Apple’s TensorFlow plugin by using Apache TVM on M1☆136Updated 2 years ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆28Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆181Updated this week
- ☆63Updated 10 months ago
- ☆10Updated 2 years ago
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago