my cs notes
☆61Oct 14, 2024Updated last year
Alternatives and similar repositories for cs-notes
Users that are interested in cs-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A classic 5-stage rv32i(incomplete) toy implementation based on powerful SpinalHDL☆10Jul 5, 2021Updated 4 years ago
- ☆42Mar 4, 2026Updated 2 weeks ago
- ☆20Jun 4, 2021Updated 4 years ago
- 收录SC小组在学习高性能计算、分布式架构、数据挖掘与人工智能方向的笔记和材料☆15Oct 29, 2021Updated 4 years ago
- 🎉CUDA 笔记 / 高频面试题汇总 / C++笔记,个人笔记,更新随缘: sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.☆40Jan 25, 2024Updated 2 years ago
- A light llama-like llm inference framework based on the triton kernel.☆176Jan 5, 2026Updated 2 months ago
- compile yolov3 in TVM☆13Aug 14, 2023Updated 2 years ago
- My study note for mlsys☆14Nov 4, 2024Updated last year
- Using TVM to depoly Transformer on CPU and GPU☆11Aug 25, 2021Updated 4 years ago
- Homework of CMU 10-414/714: Deep Learning Systems (https://dlsyscourse.org/)☆15Mar 21, 2024Updated 2 years ago
- An implementation for Sugyama's algorithm for displaying a layered graph.☆24Sep 21, 2025Updated 6 months ago
- ☆11Jun 24, 2015Updated 10 years ago
- 校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。☆512Oct 28, 2025Updated 4 months ago
- ☆13Apr 12, 2023Updated 2 years ago
- ☆68Mar 4, 2023Updated 3 years ago
- A PyG-based package of spectral GNNs with benchmark evaluations (SIGMOD 2026).☆19Aug 20, 2025Updated 7 months ago
- An agent for CUDA compute-communication kernel co-design☆32Mar 11, 2026Updated 2 weeks ago
- ☆16Mar 26, 2020Updated 5 years ago
- ☆13Dec 29, 2020Updated 5 years ago
- 使用 CUDA C++ 实现的 llama 模型推理框架☆63Nov 8, 2024Updated last year
- Dockerfile for RL research. Including MuJoCo / DMC / PyTorch / Tensoflow / Atari support.☆16Jan 5, 2022Updated 4 years ago
- PyTorch -> ONNX -> TVM for autotuning☆24Feb 28, 2020Updated 6 years ago
- Memory footprint reduction for transformer models☆11Jan 24, 2023Updated 3 years ago
- Herald: Accelerating Neural Recommendation Training with Embedding Scheduling (NSDI 2024)☆23May 9, 2024Updated last year
- Nicol is an open-source web service, developed using the Kotlin programming language, that enables streaming Server Stream Events and s…☆11Dec 10, 2023Updated 2 years ago
- Course Projects for Stanford CS142 Web Applications☆10Oct 15, 2016Updated 9 years ago
- 一个LLMs接口的学习示例☆19Apr 22, 2025Updated 11 months ago
- Code release for book "Efficient Training in PyTorch"☆126Apr 10, 2025Updated 11 months ago
- 基于RabbitMQ和本地消息表实现的分布式事务框架☆14Aug 23, 2023Updated 2 years ago
- A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters☆57Jul 23, 2024Updated last year
- [ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks☆11May 21, 2023Updated 2 years ago
- Code for the SIGIR20 paper -- Measuring and Mitigating Item Under-Recommendation Bias inPersonalized Ranking Systems☆16Apr 28, 2020Updated 5 years ago
- 基于 AOP、Spring 动态数据源切换、MyBatis 插件开发、散列算法等技术,实现的 SpringBoot Starter 数据库路由组件,该组件在分库分表场景下,支持个性化的分库分表、只分库或只分表,甚至双字段控制分库分表。它的横向扩展性和易维护性为系统的持续发展…☆18Jul 6, 2023Updated 2 years ago
- ASIC Verification at 2022 Spring. This course only use SystemVerilog, did not use UVM.☆19Feb 14, 2023Updated 3 years ago
- Scalable radix top-k selection on GPUs.☆21Jan 27, 2025Updated last year
- This is the official impletations of the EMNLP Findings paper, VideoINSTA: Zero-shot Long-Form Video Understanding via Informative Spatia…☆25Nov 15, 2024Updated last year
- SystemC Common Practices (SCP)☆35Feb 27, 2026Updated 3 weeks ago
- Classification of audio signals using PyTorch☆13May 19, 2020Updated 5 years ago
- ☆12Jul 8, 2022Updated 3 years ago