C++ training, season 1
☆57Aug 2, 2022Updated 3 years ago
Alternatives and similar repositories for cpp-training-season1
Users that are interested in cpp-training-season1 are comparing it to the libraries listed below
Sorting:
- ☆11Apr 5, 2021Updated 4 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- OneFlow Serving☆21Apr 10, 2025Updated 10 months ago
- a simple API to use CUPTI☆11Aug 19, 2025Updated 6 months ago
- ☆12Sep 1, 2023Updated 2 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- 小彭老师推出 SyCL 2020 课程(施工中,日后会在直播中放出)☆15Sep 3, 2023Updated 2 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆65Mar 21, 2022Updated 3 years ago
- Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.☆46Jun 11, 2025Updated 8 months ago
- Python package for rematerialization-aware gradient checkpointing☆27Oct 31, 2023Updated 2 years ago
- Here is a final lab of Compiler in USTC, focusing on MLIR☆20Jan 29, 2021Updated 5 years ago
- Sample project for a small, flexible runtime reflection system using C++11☆309Aug 29, 2020Updated 5 years ago
- ☆29Oct 3, 2022Updated 3 years ago
- https://start.oneflow.org/oneflow-yolo-doc☆23Mar 14, 2023Updated 2 years ago
- ☆25Jun 24, 2021Updated 4 years ago
- Nex Venus Communication Library☆72Nov 17, 2025Updated 3 months ago
- ☆27May 27, 2024Updated last year
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- Transformers components but in Triton☆34May 9, 2025Updated 9 months ago
- A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction☆32Feb 1, 2023Updated 3 years ago
- Prefix-Aware Attention for LLM Decoding☆29Jan 23, 2026Updated last month
- ☆30Aug 31, 2022Updated 3 years ago
- [c++]使用boost.asio写的简单内存键值对缓存☆11Jul 31, 2017Updated 8 years ago
- Kunlun-storage is the storage component for Kunlun distributed DBMS. It's developed based on percona-mysql-8.0.x and contains exclusive f…☆32Apr 11, 2022Updated 3 years ago
- ☆38Oct 12, 2024Updated last year
- 《现代OpenGL保姆级课程》的课件专用仓库☆81May 26, 2024Updated last year
- ☆77Dec 18, 2022Updated 3 years ago
- Document the demo and a series of documents for learning the diffusion model.☆42Jun 29, 2023Updated 2 years ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆163Feb 11, 2026Updated 2 weeks ago
- Datasets, Transforms and Models specific to Computer Vision☆91Nov 17, 2023Updated 2 years ago
- 分布式kv存储☆10Sep 8, 2020Updated 5 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- A simple OpenGL 3.2 example using MSVS 2010 and freeglut☆12Feb 4, 2013Updated 13 years ago
- ☆10Aug 15, 2022Updated 3 years ago
- 建筑语义分割标签转实例分割标签☆11Sep 8, 2021Updated 4 years ago
- A distributed stream querying engine that provides sub-millisecond stateful query at millions of queries per-second over fast-evolving li…☆10Jul 18, 2018Updated 7 years ago
- 实现一个子集c编译器,后端基于llvm20☆12Mar 13, 2025Updated 11 months ago
- 3D A* pathfinding for UAVs with no-fly zone avoidance and real-time Cesium visualization | 无人机三维A*寻路算法,支持禁飞区规避与实时Cesium可视化☆32Oct 21, 2025Updated 4 months ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago