☆54Dec 4, 2023Updated 2 years ago
Alternatives and similar repositories for ECE408
Users that are interested in ECE408 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆148Jul 2, 2021Updated 5 years ago
- Applied Parallel Programming UIUC FA 2017☆31Jan 14, 2018Updated 8 years ago
- Xiao's CUDA Optimization Guide [NO LONGER ADDING NEW CONTENT]☆327Nov 8, 2022Updated 3 years ago
- CUDA solutions for the lab assignments in the UIUC-ECE408 Applied Parallel Programming course.☆21Apr 18, 2023Updated 3 years ago
- ☆24Oct 31, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository for AI model benchmarking on TT-Buda☆16Feb 9, 2026Updated 4 months ago
- Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.☆18Dec 1, 2023Updated 2 years ago
- CORE-ReID: Comprehensive Optimization and Refinement through Ensemble fusion in Domain Adaptation for person re-identification☆16May 7, 2025Updated last year
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆79Jan 21, 2021Updated 5 years ago
- SystemVerilog implemention of the TAGE branch predictor☆14May 26, 2021Updated 5 years ago
- 斯坦福2022春季编译原理实验☆24Mar 8, 2023Updated 3 years ago
- Triton Compiler related materials.☆45Mar 16, 2026Updated 3 months ago
- [HPCA 2026] A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.☆96May 14, 2026Updated last month
- a simple API to use CUPTI☆10Aug 19, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ECE408 (Applied Parallel Programming) Fall 2022 MP☆21Mar 24, 2023Updated 3 years ago
- Trajectory planning for highway situation with classic robotics approach.☆12May 23, 2018Updated 8 years ago
- flash attention tutorial written in python, triton, cuda, cutlass☆525Jan 20, 2026Updated 5 months ago
- 高性能计算相关知识学习笔记,包含学习笔记和相关知识的代码demo,在持续完善中。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!☆474Mar 28, 2023Updated 3 years ago
- ☆184May 11, 2026Updated last month
- Very simple and stupid TCP/IP stack written in C☆10Mar 25, 2016Updated 10 years ago
- A Light CNN Framework!☆16Apr 8, 2019Updated 7 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Jun 22, 2026Updated last week
- My solutions for the data challenges of Accelerated Computer Science Fundamentals Certification on Coursera☆11Oct 10, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 《C++23 Best Practices》的非专业个人翻译☆30Nov 27, 2025Updated 7 months ago
- Tenstorrent Firmware repository☆24Feb 25, 2026Updated 4 months ago
- Tutorial for assignment of Introduction to Database System☆12Sep 29, 2025Updated 9 months ago
- ☆16Oct 21, 2023Updated 2 years ago
- My second attempt at a RISC-V CPU with learnings form my previous attempt.☆11Apr 27, 2026Updated 2 months ago
- A PyG-based package of spectral GNNs with benchmark evaluations (SIGMOD 2026).☆19Aug 20, 2025Updated 10 months ago
- tensorflow fork with Salus integration☆12Jan 7, 2022Updated 4 years ago
- Simple Robin Hood hash table implemented using C macros☆15Feb 7, 2025Updated last year
- Simple webapp (Google Appengine) to convert UIUC course listings into importable calendar files☆24Aug 8, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Fast and easy distributed model training examples.☆12Nov 26, 2024Updated last year
- Herald: Accelerating Neural Recommendation Training with Embedding Scheduling (NSDI 2024)☆23May 9, 2024Updated 2 years ago
- My solution code to parallel architecture and programming Spring 2016☆12Aug 15, 2016Updated 9 years ago
- Inference Llama 2 in one file of pure Cuda☆17Aug 20, 2023Updated 2 years ago
- ☆121Apr 11, 2024Updated 2 years ago
- A PyTorch-like deep learning framework. Just for fun.☆157Oct 9, 2023Updated 2 years ago
- Rust bindings for SPDK☆12Mar 5, 2020Updated 6 years ago