Google Colab Notebooks for Udacity CS344 - Intro to Parallel Programming
☆136Apr 14, 2021Updated 4 years ago
Alternatives and similar repositories for udacity-cs344-colab
Users that are interested in udacity-cs344-colab are comparing it to the libraries listed below
Sorting:
- CS344 - Introduction To Parallel Programming course (Udacity) proposed solutions☆54Jul 23, 2017Updated 8 years ago
- Fibertree emulator☆17Nov 4, 2024Updated last year
- ☆15Apr 18, 2023Updated 2 years ago
- Implementation for MIT 6.824 Distributed System☆14Jul 18, 2014Updated 11 years ago
- A mini-app to solve the heat conduction equation☆15Jul 1, 2020Updated 5 years ago
- Canvas: End-to-End Kernel Architecture Search in Neural Networks☆27Nov 18, 2024Updated last year
- Using TVM to depoly Transformer on CPU and GPU☆11Aug 25, 2021Updated 4 years ago
- Code for SIGGRAPH 2025 conference paper "Automated Task Scheduling for Cloth and Deformable Body Simulations in Heterogeneous Computing E…☆26Nov 22, 2025Updated 4 months ago
- Parallel programming tutorials☆637Mar 28, 2021Updated 4 years ago
- a simple physically based rendering renderer.☆11Jun 2, 2021Updated 4 years ago
- A C++ implementation of Yolov5 helmet detection in Jetson Xavier nx and Jetson nano☆22Nov 11, 2021Updated 4 years ago
- ☆12Nov 5, 2022Updated 3 years ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆84Mar 20, 2023Updated 3 years ago
- Homework of CMU 10-414/714: Deep Learning Systems (https://dlsyscourse.org/)☆15Mar 21, 2024Updated 2 years ago
- 深度学习模型部署基础☆50May 18, 2021Updated 4 years ago
- PTX-EMU is a simple emulator for CUDA program.☆38Apr 25, 2025Updated 10 months ago
- ☆36Apr 10, 2024Updated last year
- Numerical Experiments☆15Jan 21, 2018Updated 8 years ago
- ☆2,714Jan 16, 2024Updated 2 years ago
- ☆82Mar 4, 2022Updated 4 years ago
- Assembler and Decompiler for NVIDIA (Maxwell Pascal Volta Turing Ampere) GPUs.☆94Feb 23, 2023Updated 3 years ago
- Disparity Maps and Image Segmentation Implementation☆10Jul 16, 2018Updated 7 years ago
- Laser Plasma Interaction Cheat-Sheet☆11Apr 2, 2025Updated 11 months ago
- Code Release for NeurIPS 2025, "COS3D: Collaborative Open-Vocabulary 3D Segmentation"☆16Dec 21, 2025Updated 3 months ago
- ☆22Sep 18, 2019Updated 6 years ago
- An OpenRISC 1000 multi-core virtual platform based on SystemC/TLM☆16Mar 25, 2025Updated 11 months ago
- Calculate texture mipmap level in shader.☆23Jun 22, 2019Updated 6 years ago
- 📓 A LaTeX template for writing thesis report for RUET☆12Jan 7, 2016Updated 10 years ago
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆13Jun 28, 2025Updated 8 months ago
- Cryptography accelerator ASIC (for AES128/AES256 and SHA256) using Skywater 130nm process node (build-environment repo).☆11Jan 13, 2021Updated 5 years ago
- My solution code to parallel architecture and programming Spring 2016☆12Aug 15, 2016Updated 9 years ago
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆25May 12, 2025Updated 10 months ago
- [AAAI 2024] SiMA-Hand: Boosting 3D Hand-Mesh Reconstruction by Single-to-Multi-view Adaptation, Pytorch implementation.☆11Feb 6, 2024Updated 2 years ago
- ☆10Dec 3, 2020Updated 5 years ago
- compiler learning resources collect.☆2,693Mar 19, 2025Updated last year
- ☆22Updated this week
- Convert C files into Verilog☆21Jan 27, 2019Updated 7 years ago
- let coding agents use ncu skills analysis cuda program automatically!☆61Feb 5, 2026Updated last month
- This is the final project for the BP NN FPGA implementation☆11Jan 14, 2017Updated 9 years ago