Assignment 1 for the CMU 15418 Course
☆25Aug 7, 2020Updated 5 years ago
Alternatives and similar repositories for assignment1
Users that are interested in assignment1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stanford CS149 -- Assignment 1☆20Jul 24, 2021Updated 4 years ago
- cutile kernel examples☆50Apr 3, 2026Updated 2 months ago
- Calculate the groundstate energy of 1D and 2D Fermi-Hubbard model on the GPU with Cuda.☆14Aug 30, 2017Updated 8 years ago
- TinyML and Efficient Deep Learning Computing☆20Apr 26, 2024Updated 2 years ago
- jump to a place when progam runs to the max instruction number☆16Dec 14, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆19Apr 6, 2024Updated 2 years ago
- An x86 Intel processor information gatherer☆14Jan 13, 2018Updated 8 years ago
- ☆12Aug 21, 2019Updated 6 years ago
- ☆13Oct 18, 2016Updated 9 years ago
- Langchain Agent finetuning using 7B - LLAMA 2 , on hotpotQA (Retroformer framework)☆16Sep 5, 2023Updated 2 years ago
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆25Jun 13, 2023Updated 3 years ago
- CSAPP exercises answers and labs☆20Aug 16, 2021Updated 4 years ago
- ☆11Apr 16, 2022Updated 4 years ago
- Papers about infrastructure (deployment & serving) and systems for compound AI☆13Nov 6, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DGEMM on KNL, achieve 75% MKL☆19May 19, 2022Updated 4 years ago
- ☆23Apr 16, 2020Updated 6 years ago
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆138Sep 25, 2023Updated 2 years ago
- The PackNet Continual Learning Method in Pytorch☆15Aug 19, 2021Updated 4 years ago
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,近30万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系sc…☆10Jan 31, 2019Updated 7 years ago
- ☆30Sep 4, 2023Updated 2 years ago
- CUDA Tensor Transpose (cuTT) library☆55Aug 10, 2017Updated 8 years ago
- This is the course taught by Prof.John Shen and Prof. Onur Mutlu from CMU☆11May 13, 2016Updated 10 years ago
- Heterogeneous Model Reuse via Optimizing Multiparty Multiclass Margin☆11Jan 15, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CMU 15-745 Spring 2014☆10Mar 7, 2014Updated 12 years ago
- 适用于微信小游戏的Three.js版本☆25Jan 5, 2018Updated 8 years ago
- MegEngine build with cu11x☆17Mar 13, 2023Updated 3 years ago
- Course project of CG(CS100433, 2019 fall, Tongji Univ.): A real-time billiard simulator using both ray tracing and graphics pipeline.☆12Sep 19, 2020Updated 5 years ago
- A Triton JIT runtime and ffi provider in C++☆36May 27, 2026Updated last month
- Stanford CS144 Solutions Fall2020☆14Aug 30, 2022Updated 3 years ago
- My solution code to parallel architecture and programming Spring 2016☆12Aug 15, 2016Updated 9 years ago
- LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model☆118Apr 28, 2026Updated 2 months ago
- Final Project for Parallel Computing at CMU (15-618/15-418)☆10May 13, 2016Updated 10 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆11May 6, 2019Updated 7 years ago
- wiki for the Hyades cluster at UCSC☆36May 11, 2020Updated 6 years ago
- ☆14Dec 12, 2018Updated 7 years ago
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Aug 29, 2022Updated 3 years ago
- rapids团队 (https://github.com/WANG-lp and https://github.com/CheYulin ),香港科技大学,2017年,第三届阿里中间件性能挑战赛初赛代码(第32名)☆14Jul 7, 2017Updated 8 years ago
- It is implementation of Research paper "DEEP GRADIENT COMPRESSION: REDUCING THE COMMUNICATION BANDWIDTH FOR DISTRIBUTED TRAINING". Deep g…☆18Aug 14, 2019Updated 6 years ago
- PyTorch implementation of "Online Hyperparameter Optimization for Class-Incremental Learning" (AAAI 2023 Oral)☆17Jun 30, 2023Updated 2 years ago