Hsin-Hung / N-body-simulation
a real-time N-body simulation with the Barnes-Hut algorithm and CUDA
☆9Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for N-body-simulation
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆11Updated last year
- A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerat…☆9Updated last year
- Component for lazy image loading. Written in Vue js.☆8Updated 3 years ago
- The project leverages Apache Flink, Apache Kafka and Python digital Twin to provide real-time insights into healthcare data, enabling tim…☆10Updated last year
- QSimPy: A Learning-centric Simulation Framework for Quantum Cloud Resource Management☆9Updated 2 months ago
- ☆26Updated 5 years ago
- A collection of awesome algorithms, implemented in CUDA.☆24Updated 6 years ago
- ROC_SHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆39Updated last year
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Updated 3 years ago
- Performance Prediction Toolkit☆51Updated 3 years ago
- Intel® SHMEM - Device initiated shared memory based communication library☆21Updated 2 weeks ago
- ☆10Updated last week
- GPU Performance Advisor☆63Updated 2 years ago
- Resilient Virtual Machine Monitor is a complete fault tolerance solution for type-I hypervisors adopting one of the most popular VMM arch…☆10Updated 4 years ago
- OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library☆19Updated 4 years ago
- Web-Based Video Editor☆11Updated last year
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆14Updated 5 years ago
- A benchmark suite for Graph Machine Learning☆17Updated last month
- Analyze data any time, anywhere☆10Updated last year
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆43Updated 10 months ago
- StarPU Runtime system☆16Updated 14 years ago
- GEMM and Winograd based convolutions using CUTLASS☆25Updated 4 years ago
- This is the source code of the 2021 replication for ReScience of the paper "Speedup Graph Processing by Graph Ordering" by Hao Wei, Jeffr…☆10Updated 3 years ago
- Reference implementation of Deep Neural Network primitives using LIBXSMM's Tensor Processing Primitives (TPP)☆12Updated 3 months ago
- MLIR-based toolkit targeting intel heterogeneous hardware☆32Updated this week
- Guides and examples to help achieve optimal performance on a NVIDIA Grace CPU☆12Updated 3 months ago
- Regex Engine using SIMD and Roaring-Bitmaps☆8Updated last year
- A web interface for the SuiteSparse Matrix Collection, formerly known as the University of Florida Sparse Matrix Collection☆22Updated 3 weeks ago
- This package includes the implementation for Sparse-Matrix-Vector-Multiplication (SpMV) and Sparse-Matrix-Matrix-Multiplication (SpMM) fo…☆10Updated 4 years ago