shen-shanshan / cs-self-learningView external linksLinks
This repo is used for archiving my notes, codes and materials of cs learning.
☆79Updated this week
Alternatives and similar repositories for cs-self-learning
Users that are interested in cs-self-learning are comparing it to the libraries listed below
Sorting:
- Implementation from scratch in CUDA C++ of image processing algorithms.☆21Oct 26, 2020Updated 5 years ago
- Optimize softmax in triton in many cases☆22Sep 6, 2024Updated last year
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆101Aug 25, 2025Updated 5 months ago
- Triton Compiler related materials.☆43Jan 4, 2025Updated last year
- ☆45May 4, 2025Updated 9 months ago
- C++ and CUDA extensions for Python/Pytorch and GPU Accelerated Augmentation.☆35Nov 30, 2022Updated 3 years ago
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆44Dec 11, 2023Updated 2 years ago
- ☆49Apr 15, 2024Updated last year
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 2 months ago
- Workshop materials for AI Engineer World's Fair☆13Jun 3, 2025Updated 8 months ago
- ☆54Mar 15, 2025Updated 10 months ago
- 跟着Tensorrt_pro学习各种知识☆40Nov 25, 2022Updated 3 years ago
- This repository is the summary of all of our works for the XLA.☆11Jan 14, 2018Updated 8 years ago
- ☆14Feb 5, 2026Updated last week
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆29Updated this week
- Bee is a tool for helping develop with beego app framework.☆12Nov 14, 2018Updated 7 years ago
- An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…☆10Dec 18, 2019Updated 6 years ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- A toolkit for developers to simplify the transformation of nn.Module instances. It's now corresponding to Pytorch.fx.☆13Apr 7, 2023Updated 2 years ago
- ☆11Sep 21, 2022Updated 3 years ago
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- supper pubsub framework based on asynchronous models. provide Topic filter, Fault isolation etc.☆11Jun 3, 2017Updated 8 years ago
- LaTex template for ITMO style presentations☆10Jan 19, 2025Updated last year
- PyTorch implementation of GRPO.☆14Apr 21, 2025Updated 9 months ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- cpp rotation album,基于cpp eigen实现的3d旋转相册,GAMES101复现内容☆12Jul 25, 2022Updated 3 years ago
- hyperscan using dpdk☆12Jul 15, 2018Updated 7 years ago
- OpenFlow protocol endpoint written in C++☆10Jun 12, 2025Updated 8 months ago
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- ☆13Feb 4, 2021Updated 5 years ago
- rkllm_talking is a standalone compiled voice communication system based on a large model || rkllm_talking 是一个独立编译的基于大模…☆12Oct 13, 2024Updated last year
- ☆40Oct 24, 2023Updated 2 years ago
- ☆47Mar 27, 2023Updated 2 years ago
- ☆12Aug 31, 2023Updated 2 years ago
- ☆25Oct 11, 2025Updated 4 months ago
- URI Component encoder/decoder☆24Feb 28, 2015Updated 10 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Feb 8, 2026Updated last week
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 5 months ago
- https://bbuf.github.io/gpu-glossary-zh/☆26Nov 7, 2025Updated 3 months ago