thesis-nozal / PhDLinks
"Optimizing Performance and Energy Efficiency in Massively Parallel Systems" PhD Dissertation repository.
☆29Updated 3 years ago
Alternatives and similar repositories for PhD
Users that are interested in PhD are comparing it to the libraries listed below
Sorting:
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Updated 4 years ago
- DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems…☆16Updated 8 months ago
- Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying…☆11Updated 4 years ago
- Sparse Matrix Factorization (SMF) is a key component in many machine learning problems and there exist a verity a applications in real-w…☆11Updated 10 years ago
- AI-Agency-Website is a modern, responsive website for an AI-driven agency, featuring sleek design, dynamic content, and optimized perform…☆15Updated 10 months ago
- High-performance CUDA kernels for real-time financial low latency inference, optimized for both consumer and datacenter GPUs.☆20Updated 6 months ago
- Usability and Performance in Heterogeneous Computing. Official EngineCL repository. Peer-reviewed (FGCS).☆21Updated 5 years ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 3 years ago
- A benchmark suite for performance-oriented shell-optimization research☆29Updated 2 months ago
- The SparseX sparse kernel optimization library☆43Updated 7 years ago
- Barcelona OpenMP Task Suite is a collection of applications that allow to test OpenMP tasking implementations and compare its behaviour u…☆46Updated 6 years ago
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆23Updated 7 months ago
- Thoughts on programming languages, compilers, optimization, and performance.☆10Updated 6 years ago
- COBAYN: Compiler Autotuning Framework Using Bayesian Networks☆20Updated 3 years ago
- An easy-to-use automatic performance diagnosis and optimization tool for HPC applications☆35Updated 8 years ago
- Little OpenMP Library☆170Updated 3 years ago
- Teaching Vectorization and SIMD using Intel Intrinsics in a Computer Organization and Architecture class☆17Updated 11 months ago
- A Task-based Library for Solving Dense Nonsymmetric Eigenvalue Problems☆23Updated 3 years ago
- NAS Parallel Benchmark Kernels in C/C++. The parallel versions are in FastFlow, TBB, and OpenMP.☆22Updated 4 years ago
- Global Memory and Threading runtime system☆24Updated last month
- OpenMP Offloading Validation & Verification Suite; Official repository. We have migrated from bitbucket!! For documentation, results, pub…☆59Updated last week
- TTG: Template Task Graph C++ API☆26Updated 2 months ago
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆76Updated 3 months ago
- A Method for efficiently processing SpMV using SIMD and load balancing☆17Updated 3 years ago
- MPI Tutorial Exercises☆46Updated 12 years ago
- Information about many aspects of high-performance computing. Wiki content moved to ~/docs.☆312Updated last month
- StarPU Runtime system☆16Updated 15 years ago
- DiscoPoP - Discovery of Potential Parallelism☆52Updated 2 weeks ago
- KaGen: Communication-free Massively Distributed Graph Generators☆41Updated last week
- Practical exercises for HOW Series "Deep Dive", a Web-based training on parallel programming and performance optimization☆33Updated 7 years ago