jinbooooom / ai-infra-hpcView external linksLinks
hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等
☆116Feb 4, 2026Updated last week
Alternatives and similar repositories for ai-infra-hpc
Users that are interested in ai-infra-hpc are comparing it to the libraries listed below
Sorting:
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆56Jul 3, 2022Updated 3 years ago
- PHASE(Parallel High-performence Agent-based Simulation Environment)☆10Jun 30, 2020Updated 5 years ago
- An Automated Performance Optimization Framework for P4-Programmable SmartNICs☆28Nov 18, 2023Updated 2 years ago
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆29May 6, 2021Updated 4 years ago
- FUSION is an open-source project aimed at revolutionizing networking through the simulation of advanced SD-EONs and AI-enhanced networks,…☆13Updated this week
- Artifact code release for paper "Uniform-Cost Multi-Path Routing for Reconfigurable Data Center Networks"☆12Sep 5, 2024Updated last year
- All Resources from Stanford CS106B 2021☆23Jul 11, 2025Updated 7 months ago
- ☆45May 4, 2025Updated 9 months ago
- ☆17May 27, 2025Updated 8 months ago
- Extending BookSim2.0 and HotSpot6.0 for Power, Performance and Thermal evaluation of 3D NoC Architectures☆12Aug 9, 2019Updated 6 years ago
- xDEVS: A cross-platform Discrete EVent System simulator☆14Nov 14, 2025Updated 3 months ago
- rewrite python scipy.signal.lfilter in c code☆11Aug 13, 2019Updated 6 years ago
- 使用 cutlass 实现 flash-attention 精简版,具有教学意义☆56Aug 12, 2024Updated last year
- ☆12Nov 8, 2024Updated last year
- ☆14Oct 11, 2024Updated last year
- muslx32 (musl libc and x32 abi) overlay for Gentoo Linux☆10Apr 21, 2021Updated 4 years ago
- fast_faceswap use dlib and change_style_network(基于dlib和风格迁移网络的快速换脸)☆11Jul 18, 2019Updated 6 years ago
- 一个用于管理多个 Claude API 配置的命令行工具。可以轻松在不同环境或账户的 API 密钥和基础 URL 之间切换。☆23Aug 7, 2025Updated 6 months ago
- NonNegative Matrix Factorization with Low Rank via the Alternating Direction Method of Multipliers☆10Jul 4, 2017Updated 8 years ago
- GEMM☆10Aug 26, 2023Updated 2 years ago
- ☆11Sep 21, 2022Updated 3 years ago
- A collection of scripts for producing and analyzing simulations, for computational materials science.☆11Jun 10, 2015Updated 10 years ago
- 带拼音、字形特征的文本纠错模型☆11Jan 1, 2023Updated 3 years ago
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆29Feb 10, 2026Updated last week
- NS3 simulator for RDMA load balancing☆11Jan 31, 2025Updated last year
- Research paper list for host networking: in a system view☆10Jan 2, 2025Updated last year
- AI Accelerators-SC23-tutorial Repository☆11Nov 12, 2023Updated 2 years ago
- Code for nonconvex graph trend filtering☆10May 6, 2022Updated 3 years ago
- Implementation example of Distributed Tensorflow☆10Jul 22, 2017Updated 8 years ago
- ☆15Sep 24, 2023Updated 2 years ago
- Prediction and control of fracture paths in disordered architected materials using graph neural networks☆12Apr 21, 2023Updated 2 years ago
- Python and C++ library to process both experimental and simulation data of colloidal particles.☆15Sep 2, 2021Updated 4 years ago
- Modified Pytorch Lightning implementation of paper:-https://jcheminf.biomedcentral.com/track/pdf/10.1186/s13321-019-0407-y☆10Dec 22, 2020Updated 5 years ago
- Data relevant to the article "Machine learning determination of atomic dynamics at grain boundaries" https://arxiv.org/abs/1803.01416☆11Oct 2, 2018Updated 7 years ago
- MuSim - The Microservices simulator☆13Feb 2, 2016Updated 10 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- Cut-pursuit with preconditioned forward-Douglas-Rachford for regularization of classical functionals by graph total variation☆17Aug 5, 2020Updated 5 years ago
- Multimedia SoC Design with Specialization on Application Acceleration with High-Level-Synthesis [2020 Fall]☆12Jun 15, 2021Updated 4 years ago
- Official code repository for the paper titled "Efficient Molecular Conformer Generation with SO(3) Averaged Flow-Matching and Reflow" (IC…☆13Jan 8, 2026Updated last month