Practice exercises and assessments for NVIDIA DLI's "Fundamentals of Accelerated Computing with CUDA Python" course.
☆30Sep 8, 2023Updated 2 years ago
Alternatives and similar repositories for Fundamentals_of_Accelerated_Computing_with_CUDA_Python
Users that are interested in Fundamentals_of_Accelerated_Computing_with_CUDA_Python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HLS project modeling various sparse accelerators.☆12Jan 11, 2022Updated 4 years ago
- ☆29Apr 7, 2025Updated last year
- Perceptron-based branch predictor written in C++☆13Dec 14, 2016Updated 9 years ago
- Pipelined 64-bit RISC-V core☆16Mar 7, 2024Updated 2 years ago
- ADC & LCD Interfacing using Verilog & VHDL☆12Feb 27, 2017Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Design and implementation of a reconfigurable FIR filter in FPGA☆15Sep 26, 2022Updated 3 years ago
- Lecture about FIR filter on an FPGA☆13May 15, 2024Updated last year
- ☆18Mar 12, 2025Updated last year
- ai_accelerator_basic_for_student (no solve)☆18Mar 27, 2020Updated 6 years ago
- Development repository for the Triton language and compiler☆23Sep 17, 2025Updated 7 months ago
- Swiss army knife glsl utility. Uses TOP textures for transforms, vertex based noise displacement with normal recalculation.☆15Jan 17, 2020Updated 6 years ago
- ComfyUI-VRAM-Manager is an independent memory management custom node for ComfyUI. Provides Distorch memory management functionality for e…☆29Mar 28, 2026Updated last month
- A simple component for collecting and move/copy all referenced files to the project directory.☆16Mar 25, 2020Updated 6 years ago
- MIPS multi cycle Verilog implementation based on Computer Organization and Design by David A. Patterson and John L. Hennessy☆23Jul 3, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Oct 5, 2025Updated 7 months ago
- ☆54Jan 9, 2024Updated 2 years ago
- A Heterogeneous GPU Platform for AI and Neural Graphics☆50Updated this week
- An experimental node☆45Oct 28, 2025Updated 6 months ago
- A repo of TouchDesigner style gude elements☆15Mar 11, 2021Updated 5 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated 11 months ago
- FastSAM TouchDesigner Plugin – A TouchDesigner .tox plugin for real-time segmentation using FastSAM☆20Apr 17, 2025Updated last year
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- ☆48Mar 12, 2025Updated last year
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- 📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).☆76Apr 26, 2025Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 11 months ago
- Research on DeepSeek Sparse Attention☆40Oct 8, 2025Updated 6 months ago
- ☆12Jul 8, 2024Updated last year
- ☆16Nov 23, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 2 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated 2 years ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆17Feb 24, 2025Updated last year
- Learn about image processing with an FPGA. Video lectures explain algorithm and implementation of lane detection for automotive driving. …☆46May 15, 2024Updated last year
- TouchDesigner Documentation MCP Server v2.6.1 - FIXED Python API tools! Features 629 operators + 14 tutorials + 69 Python API classes wit…☆54Feb 24, 2026Updated 2 months ago
- Companion code to the preprint: E Bıyık, K Wang, N Anari, D Sadigh, "Batch Active Learning using Determinantal Point Processes". arXiv pr…☆15Jul 25, 2024Updated last year