AnthonyYsw / CS106B_2021_rawView external linksLinks
All Resources from Stanford CS106B 2021
☆23Jul 11, 2025Updated 7 months ago
Alternatives and similar repositories for CS106B_2021_raw
Users that are interested in CS106B_2021_raw are comparing it to the libraries listed below
Sorting:
- 上海交通大学软件学院课程《应用系统体系架构》(SE3353)笔记☆12Feb 2, 2024Updated 2 years ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- ShanghaiTech CS101 Algorithm and Data Structures, Fall 2022, Fall 2024.☆12Sep 30, 2025Updated 4 months ago
- ☆11Sep 21, 2022Updated 3 years ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- 软件工程与计算II☆11Dec 29, 2020Updated 5 years ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆29Updated this week
- NJUCS 2021 秋季学期<高级程序设计>课设☆10Jun 28, 2025Updated 7 months ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 5 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆17Updated this week
- ☆17Nov 22, 2025Updated 2 months ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- Computer programming - ShanghaiTech☆12Jan 10, 2020Updated 6 years ago
- ☆12Aug 31, 2023Updated 2 years ago
- Cute layout visualization☆30Jan 18, 2026Updated 3 weeks ago
- ☆14Nov 3, 2025Updated 3 months ago
- 。☆13Jan 15, 2022Updated 4 years ago
- PoPS algorithm☆15Dec 8, 2022Updated 3 years ago
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 5 months ago
- Fast GPU based tensor core reductions☆13Jan 13, 2023Updated 3 years ago
- SJTU-SE高级数据结构☆14Jun 8, 2023Updated 2 years ago
- Problem Sets for Discrete Mathematics @ software.nju.edu.cn☆12Jun 24, 2021Updated 4 years ago
- a reactor network library☆16Aug 21, 2025Updated 5 months ago
- 《汇编语言一发入魂》配套代码☆15May 30, 2020Updated 5 years ago
- Welcome to the GPU-FFT-Optimization repository! We present cutting-edge algorithms and implementations for optimizing the Fast Fourier Tr…☆21Dec 19, 2025Updated last month
- portFFT is a library implementing Fast Fourier Transforms using SYCL☆19Mar 1, 2025Updated 11 months ago
- 上海交通大学2021-2022学年秋季学期程序设计思想与方法(CS1501)课后练习通关代码☆15Dec 24, 2021Updated 4 years ago
- ☆15Mar 23, 2022Updated 3 years ago
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆16Jun 14, 2023Updated 2 years ago
- PyTorch implementations of FinGAN and TimeGAN to generate financial time series☆20Nov 13, 2024Updated last year
- ☆32Jul 2, 2025Updated 7 months ago
- To better understand the ggml library☆27Jun 13, 2025Updated 8 months ago
- Implementation and optimization of matrix multiplication on single CPU (HPC-THU-2023-Autumn)☆18Feb 27, 2024Updated last year
- auto grad in rust with video explanation.☆24Jun 19, 2025Updated 7 months ago
- ☆19Sep 23, 2020Updated 5 years ago
- Structure and Interpretation of Computer Programs (SICP) , Fall 2021, Nanjing University☆17Dec 30, 2024Updated last year
- My submission for the GPUMODE/AMD fp8 mm challenge☆29Jun 4, 2025Updated 8 months ago
- ToyLLM: Learning LLM from Scratch☆25Updated this week
- 晚上下班不刷手机,学点什么。系列一:CUDA 计算框架 CUFX (Cuda Framework eXtended)。☆16Dec 15, 2024Updated last year