☆53Dec 4, 2023Updated 2 years ago
Alternatives and similar repositories for ECE408
Users that are interested in ECE408 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆147Jul 2, 2021Updated 4 years ago
- IMPACT GPU Algorithms Teaching Labs☆59Apr 21, 2023Updated 3 years ago
- The RAI client allows one to interact with a cluster of machine to submit and evaluate code. RAI is a scalable job submission system desi…☆39Jun 30, 2019Updated 6 years ago
- Applied Parallel Programming UIUC FA 2017☆31Jan 14, 2018Updated 8 years ago
- Xiao's CUDA Optimization Guide [NO LONGER ADDING NEW CONTENT]☆326Nov 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆136May 19, 2020Updated 6 years ago
- ☆24Oct 31, 2023Updated 2 years ago
- Repository for AI model benchmarking on TT-Buda☆16Feb 9, 2026Updated 3 months ago
- 《2021医学健康数据分析与挖掘》课程论文 -- 基于BERT的20NewsGroups数据集新闻分类实验☆10Jun 22, 2021Updated 4 years ago
- Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.☆18Dec 1, 2023Updated 2 years ago
- Eliminate compaction jobs in secondary nodes within a group of replicated RocksDB.☆10Jun 5, 2024Updated last year
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆79Jan 21, 2021Updated 5 years ago
- Triton Compiler related materials.☆44Mar 16, 2026Updated 2 months ago
- 支持RTMDet、YOLOv8、YOLOX、Faster R-CNN等常见算法的ncnn部署☆13Mar 17, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- a simple API to use CUPTI☆10Aug 19, 2025Updated 9 months ago
- ECE408 (Applied Parallel Programming) Fall 2022 MP☆20Mar 24, 2023Updated 3 years ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Jul 3, 2022Updated 3 years ago
- flash attention tutorial written in python, triton, cuda, cutlass☆516Jan 20, 2026Updated 4 months ago
- 高性能计算相关知识学习笔记,包含学习笔记和相关知识的代码demo,在持续完善中。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!☆475Mar 28, 2023Updated 3 years ago
- ☆179May 11, 2026Updated last week
- Very simple and stupid TCP/IP stack written in C☆10Mar 25, 2016Updated 10 years ago
- Fireboy & Water Girl in the Forest Temple implemented on an FPGA board for UIUC's ECE385 Digital Systems Laboratory.☆22Mar 30, 2023Updated 3 years ago
- A Light CNN Framework!☆16Apr 8, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- measures and panorama ortorectification on google streetview☆20Aug 16, 2021Updated 4 years ago
- In-depth tutorials and examples on LLM training and inference infrastructure, such as, Pytorch, Fairscale, Nvidia AI Modules (cuDNN, tens…☆22May 19, 2025Updated last year
- 《C++23 Best Practices》的非专业个人翻译☆29Nov 27, 2025Updated 5 months ago
- Tenstorrent Firmware repository☆24Feb 25, 2026Updated 2 months ago
- 大满贯的场内作文合集☆18Oct 30, 2022Updated 3 years ago
- Tutorial for assignment of Introduction to Database System☆12Sep 29, 2025Updated 7 months ago
- My second attempt at a RISC-V CPU with learnings form my previous attempt.☆10Apr 27, 2026Updated 3 weeks ago
- A PyG-based package of spectral GNNs with benchmark evaluations (SIGMOD 2026).☆18Aug 20, 2025Updated 9 months ago
- ☆22Nov 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- alibabacloud-aiacc-demo☆43May 4, 2023Updated 3 years ago
- Simple Robin Hood hash table implemented using C macros☆15Feb 7, 2025Updated last year
- ☆18Aug 18, 2022Updated 3 years ago
- Fast and easy distributed model training examples.☆12Nov 26, 2024Updated last year
- 給新手的C++教學 at code風景區☆10Apr 21, 2017Updated 9 years ago
- Herald: Accelerating Neural Recommendation Training with Embedding Scheduling (NSDI 2024)☆23May 9, 2024Updated 2 years ago
- My solution code to parallel architecture and programming Spring 2016☆12Aug 15, 2016Updated 9 years ago