Practice exercises and assessments for NVIDIA DLI's "Fundamentals of Accelerated Computing with CUDA Python" course.
☆30Sep 8, 2023Updated 2 years ago
Alternatives and similar repositories for Fundamentals_of_Accelerated_Computing_with_CUDA_Python
Users that are interested in Fundamentals_of_Accelerated_Computing_with_CUDA_Python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RTL implementation of a ray-tracing GPU☆16Dec 18, 2012Updated 13 years ago
- HLS project modeling various sparse accelerators.☆12Jan 11, 2022Updated 4 years ago
- ☆17Mar 8, 2025Updated last year
- Pipelined 64-bit RISC-V core☆16Mar 7, 2024Updated 2 years ago
- ADC & LCD Interfacing using Verilog & VHDL☆12Feb 27, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Design and implementation of a reconfigurable FIR filter in FPGA☆15Sep 26, 2022Updated 3 years ago
- Implementation of the pipelined RISC V processor with many useful features as fully bypassing, dynamic branch prediction, single and mult…☆18Feb 12, 2024Updated 2 years ago
- ☆18Mar 12, 2025Updated last year
- This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…☆15Dec 31, 2024Updated last year
- ai_accelerator_basic_for_student (no solve)☆18Mar 27, 2020Updated 6 years ago
- Development repository for the Triton language and compiler☆25Sep 17, 2025Updated 8 months ago
- Open source RTL implementation of Tensor Core, Sparse Tensor Core, BitWave and SparSynergy in the article: "SparSynergy: Unlocking Flexib…☆25Mar 29, 2025Updated last year
- ☆14Oct 5, 2025Updated 7 months ago
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆13Jun 19, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated last year
- A Heterogeneous GPU Platform for AI and Neural Graphics☆56May 6, 2026Updated 3 weeks ago
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 8 months ago
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- The objective of this project was to design and implement a 5 stage pipeline CPU to support the RISC-V instruction architecture. This pip…☆29Oct 31, 2021Updated 4 years ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆14Nov 25, 2024Updated last year
- FireSim-NVDLA: NVIDIA Deep Learning Accelerator (NVDLA) Integrated with RISC-V Rocket Chip SoC Running on the Amazon FPGA Cloud☆36Sep 30, 2019Updated 6 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Cluster-level matrix unit integration into GPUs, implemented in Chipyard SoC☆56Jan 20, 2026Updated 4 months ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated last year
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 9 months ago
- ☆12Jul 8, 2024Updated last year
- Benchmarks used in the gpgpu-sim ispass 2009 paper☆31May 7, 2015Updated 11 years ago
- ☆17Nov 23, 2023Updated 2 years ago
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 2 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Predicting the Stock Market - Can we do it?☆10Jul 24, 2021Updated 4 years ago
- Project repo for the paper SILT: Self-supervised Lighting Transfer Using Implicit Image Decomposition☆10Dec 17, 2021Updated 4 years ago
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated 2 years ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆18Feb 24, 2025Updated last year
- Learn about image processing with an FPGA. Video lectures explain algorithm and implementation of lane detection for automotive driving. …☆46May 15, 2024Updated 2 years ago
- Companion code to the preprint: E Bıyık, K Wang, N Anari, D Sadigh, "Batch Active Learning using Determinantal Point Processes". arXiv pr…☆15Jul 25, 2024Updated last year
- Code to go along with my AI agents youtube video☆17Apr 5, 2024Updated 2 years ago