Practice exercises and assessments for NVIDIA DLI's "Fundamentals of Accelerated Computing with CUDA Python" course.
☆30Sep 8, 2023Updated 2 years ago
Alternatives and similar repositories for Fundamentals_of_Accelerated_Computing_with_CUDA_Python
Users that are interested in Fundamentals_of_Accelerated_Computing_with_CUDA_Python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RTL implementation of a ray-tracing GPU☆15Dec 18, 2012Updated 13 years ago
- HLS project modeling various sparse accelerators.☆12Jan 11, 2022Updated 4 years ago
- ☆29Apr 7, 2025Updated last year
- ADC & LCD Interfacing using Verilog & VHDL☆12Feb 27, 2017Updated 9 years ago
- Design and implementation of a reconfigurable FIR filter in FPGA☆15Sep 26, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of the pipelined RISC V processor with many useful features as fully bypassing, dynamic branch prediction, single and mult…☆18Feb 12, 2024Updated 2 years ago
- ☆18Mar 12, 2025Updated last year
- Programming and Assignment Material for ECE 695☆17Apr 23, 2021Updated 4 years ago
- Open source RTL implementation of Tensor Core, Sparse Tensor Core, BitWave and SparSynergy in the article: "SparSynergy: Unlocking Flexib…☆23Mar 29, 2025Updated last year
- MIPS multi cycle Verilog implementation based on Computer Organization and Design by David A. Patterson and John L. Hennessy☆23Jul 3, 2020Updated 5 years ago
- ☆13Oct 5, 2025Updated 6 months ago
- ☆19Jul 8, 2024Updated last year
- A Heterogeneous GPU Platform for Chipyard SoC☆50Apr 3, 2026Updated 2 weeks ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆13Jun 19, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆15May 28, 2025Updated 10 months ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 7 months ago
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- The objective of this project was to design and implement a 5 stage pipeline CPU to support the RISC-V instruction architecture. This pip…☆28Oct 31, 2021Updated 4 years ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆14Nov 25, 2024Updated last year
- FireSim-NVDLA: NVIDIA Deep Learning Accelerator (NVDLA) Integrated with RISC-V Rocket Chip SoC Running on the Amazon FPGA Cloud☆36Sep 30, 2019Updated 6 years ago
- Cluster-level matrix unit integration into GPUs, implemented in Chipyard SoC☆53Jan 20, 2026Updated 2 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A simple command line tool for sending and receiving Open Sound Control (OSC) commands. Suitable for testing as well as scripting purpose…☆27Mar 17, 2026Updated 3 weeks ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 10 months ago
- Gaussian Splatting for OpenFrameworks by Zach Lieberman and Char Stiles☆26Aug 3, 2024Updated last year
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 8 months ago
- ☆12Jul 8, 2024Updated last year
- Benchmarks used in the gpgpu-sim ispass 2009 paper☆31May 7, 2015Updated 10 years ago
- Realtime video processing w/ Gaussian + Sobel Filters targeting Artix-7 FPGA☆34Nov 8, 2021Updated 4 years ago
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Predicting the Stock Market - Can we do it?☆10Jul 24, 2021Updated 4 years ago
- Project repo for the paper SILT: Self-supervised Lighting Transfer Using Implicit Image Decomposition☆10Dec 17, 2021Updated 4 years ago
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated 2 years ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆16Feb 24, 2025Updated last year
- ☆48May 24, 2023Updated 2 years ago
- ☆14Apr 11, 2017Updated 9 years ago