A plugin for Jupyter Notebook to run CUDA C/C++ code
☆259Sep 13, 2024Updated last year
Alternatives and similar repositories for nvcc4jupyter
Users that are interested in nvcc4jupyter are comparing it to the libraries listed below
Sorting:
- Experiment of using Tangent to autodiff triton☆82Jan 22, 2024Updated 2 years ago
- GPU programming related news and material links☆2,047Mar 8, 2026Updated last week
- Learn CUDA with PyTorch☆253Mar 14, 2026Updated last week
- Cuda extensions for PyTorch☆12Dec 2, 2025Updated 3 months ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- ☆176Feb 3, 2024Updated 2 years ago
- ☆19Dec 4, 2025Updated 3 months ago
- This is an advanced tutorial to OpenACC and OpenMP.☆15Feb 17, 2022Updated 4 years ago
- The repository contains container recipes to build the entire stack of Xeus-Cling and Cling including cuda extension with just a few comm…☆10Dec 22, 2020Updated 5 years ago
- CUDA Learning guide☆540Jun 20, 2024Updated last year
- Display images in the terminal☆20Mar 4, 2024Updated 2 years ago
- ☆22Dec 15, 2023Updated 2 years ago
- ☆21Mar 3, 2025Updated last year
- Personal configuration☆13Feb 27, 2026Updated 3 weeks ago
- Optimized Parallel Tiled Approach to perform 2D Convolution by taking advantage of the lower latency, higher bandwidth shared memory as w…☆15Oct 17, 2017Updated 8 years ago
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Feb 21, 2022Updated 4 years ago
- Log☆11Nov 8, 2021Updated 4 years ago
- Fork of the Blaze library for compatibility with Blaze CUDA · https://bitbucket.org/blaze-lib/blaze · https://github.com/STEllAR-GROUP/bl…☆10Oct 17, 2019Updated 6 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Material for gpu-mode lectures☆5,841Feb 1, 2026Updated last month
- Pollard, kangaroo method, based on engine VanitySearch1.15☆12Oct 9, 2019Updated 6 years ago
- ☆17Aug 30, 2022Updated 3 years ago
- A White-Box Masking Scheme Against Computational and Algebraic Attacks☆13Jan 6, 2021Updated 5 years ago
- Bagua tutorials.☆13Sep 4, 2022Updated 3 years ago
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,442Updated this week
- Source-to-Source Debuggable Derivatives in Pure Python☆15Jan 23, 2024Updated 2 years ago
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- Code for ICML 2025 paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning☆27Jun 18, 2025Updated 9 months ago
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 3 months ago
- ☆132Updated this week
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆14May 4, 2024Updated last year
- A white-box Speck implementation using self-equivalence encodings☆13Jun 25, 2022Updated 3 years ago
- GPTQ inference TVM kernel☆40Apr 25, 2024Updated last year
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆8,953Jan 6, 2026Updated 2 months ago
- Landing page and repository for the 'Active Agents' tutorial held 17 July, 2024 at the 10th International Conference on Computational Soc…☆22Feb 8, 2025Updated last year
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Oct 6, 2023Updated 2 years ago
- Hacks for PyTorch☆19Apr 18, 2023Updated 2 years ago
- Utilities for Training Very Large Models☆58Sep 25, 2024Updated last year
- ☆12Jan 13, 2025Updated last year