☆50Dec 4, 2023Updated 2 years ago
Alternatives and similar repositories for ECE408
Users that are interested in ECE408 are comparing it to the libraries listed below
Sorting:
- A naive implementation of a chess engine☆12Nov 29, 2025Updated 3 months ago
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆142Jul 2, 2021Updated 4 years ago
- IMPACT GPU Algorithms Teaching Labs☆59Apr 21, 2023Updated 2 years ago
- Xiao's CUDA Optimization Guide [NO LONGER ADDING NEW CONTENT]☆323Nov 8, 2022Updated 3 years ago
- CUDA solutions for the lab assignments in the UIUC-ECE408 Applied Parallel Programming course.☆19Apr 18, 2023Updated 2 years ago
- Instructions, Docker images, and examples for Nsight Compute and Nsight Systems☆137May 19, 2020Updated 5 years ago
- ☆23Oct 31, 2023Updated 2 years ago
- 《2021医学健康数据分析与挖掘》课程论文 -- 基于BERT的20NewsGroups数据集新闻分类实验☆10Jun 22, 2021Updated 4 years ago
- Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.☆18Dec 1, 2023Updated 2 years ago
- Eliminate compaction jobs in secondary nodes within a group of replicated RocksDB.☆10Jun 5, 2024Updated last year
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆77Jan 21, 2021Updated 5 years ago
- SystemVerilog implemention of the TAGE branch predictor☆14May 26, 2021Updated 4 years ago
- Triton Compiler related materials.☆42Updated this week
- [HPCA 2026] A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.☆81Dec 18, 2025Updated 3 months ago
- 斯坦福2022春季编译原理实验☆24Mar 8, 2023Updated 3 years ago
- 支持RTMDet、YOLOv8、YOLOX、Faster R-CNN等常见算法的ncnn部署☆13Mar 17, 2024Updated 2 years ago
- ECE408 (Applied Parallel Programming) Fall 2022 MP☆20Mar 24, 2023Updated 2 years ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 7 months ago
- Implementation of the Modbus protocol in .NET; containing ASCII, RTU and TCP.☆10Jan 12, 2026Updated 2 months ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Jul 3, 2022Updated 3 years ago
- Code from the CMU LM inference fall 2025 edition.☆34Dec 7, 2025Updated 3 months ago
- flash attention tutorial written in python, triton, cuda, cutlass☆491Jan 20, 2026Updated 2 months ago
- 高性能计算相关知识学习笔记,包含学习笔记和相关知识的代码demo,在持续完善中。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!☆471Mar 28, 2023Updated 2 years ago
- Student starter code for Fall 2019 labs☆13Nov 28, 2019Updated 6 years ago
- ☆169Feb 5, 2026Updated last month
- 这是我生产实习的项目——GAN实现图像风格迁移☆13Jul 21, 2021Updated 4 years ago
- A Light CNN Framework!☆16Apr 8, 2019Updated 6 years ago
- 粤港澳大湾区金融数学建模(量化模型的建立)☆19Feb 14, 2021Updated 5 years ago
- measures and panorama ortorectification on google streetview☆20Aug 16, 2021Updated 4 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- In-depth tutorials and examples on LLM training and inference infrastructure, such as, Pytorch, Fairscale, Nvidia AI Modules (cuDNN, tens…☆20May 19, 2025Updated 10 months ago
- 《C++23 Best Practices》的非专业个人翻译☆29Nov 27, 2025Updated 3 months ago
- 大满贯的场内作文合集☆18Oct 30, 2022Updated 3 years ago
- 手搓Llama,个人学习用☆16May 21, 2024Updated last year
- multi-bit language model watermarking (NAACL 24)☆17Sep 20, 2024Updated last year
- My second attempt at a RISC-V CPU with learnings form my previous attempt.☆10Apr 29, 2024Updated last year
- A PyG-based package of spectral GNNs with benchmark evaluations (SIGMOD 2026).☆19Aug 20, 2025Updated 7 months ago
- tensorflow fork with Salus integration☆12Jan 7, 2022Updated 4 years ago
- ☆18Aug 18, 2022Updated 3 years ago