thesis-nozal / PhDLinks
"Optimizing Performance and Energy Efficiency in Massively Parallel Systems" PhD Dissertation repository.
☆29Updated 3 years ago
Alternatives and similar repositories for PhD
Users that are interested in PhD are comparing it to the libraries listed below
Sorting:
- 💻 As a Frontend Development Intern at Shen AI (Aug – Oct 2024), I built the company website using React.js and worked with the design te…☆14Updated 5 months ago
- A batched implementation for efficient Qwen2.5-VL inference.☆19Updated 3 months ago
- Sparse Matrix Factorization (SMF) is a key component in many machine learning problems and there exist a verity a applications in real-w…☆11Updated 9 years ago
- Usability and Performance in Heterogeneous Computing. Official EngineCL repository. Peer-reviewed (FGCS).☆21Updated 5 years ago
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Updated 4 years ago
- Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying…☆11Updated 3 years ago
- DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems…☆15Updated 5 months ago
- Membrane-based dehumidification is currently being considered as a promising solution for the building application due to its low cost an…☆10Updated 4 years ago
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated 2 years ago
- High-performance CUDA kernels for real-time financial low latency inference, optimized for both consumer and datacenter GPUs.☆17Updated 2 months ago
- Thoughts on programming languages, compilers, optimization, and performance.☆10Updated 6 years ago
- 🧩 Hands-on SIMD Programming with C++☆88Updated last week
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated this week
- Teaching Vectorization and SIMD using Intel Intrinsics in a Computer Organization and Architecture class☆16Updated 8 months ago
- Little OpenMP Library☆168Updated 3 years ago
- TTG: Template Task Graph C++ API☆26Updated 3 months ago
- This is a compact code for reliability analysis under uncertainty using a Polynomial Regression Machine Learning approach. The code imple…☆14Updated 6 years ago
- The NAS Parallel Benchmarks for evaluating C++ parallel programming frameworks on shared-memory architectures☆60Updated last month
- Practical exercises for HOW Series "Deep Dive", a Web-based training on parallel programming and performance optimization☆33Updated 6 years ago
- A unified framework across multiple programming platforms☆41Updated 4 months ago
- NAS Parallel Benchmarks for evaluating GPU and APIs☆27Updated 3 weeks ago
- NAS Parallel Benchmark Kernels in C/C++. The parallel versions are in FastFlow, TBB, and OpenMP.☆22Updated 4 years ago
- Work in progress: Cross Platform Game Engine☆14Updated last month
- A supercharged std::vector implementation (minus Allocator)☆36Updated 8 years ago
- High-performance C++ library for Fast Directional Chamfer Matching, optimized for template matching on untextured objects.☆13Updated 11 months ago
- Global Memory and Threading runtime system☆25Updated last year
- A fast shared & distributed memory task-based runtime in C++☆28Updated 4 years ago
- Repository for "LAFF-On Programming for High Performance"☆43Updated last year
- Principles and Methodologies for Serial Performance Optimization (OSDI' 25)☆17Updated 4 months ago
- Neuralisp is a modular machine learning framework for Common Lisp, focused on deep learning models. It offers a high-performance tensor l…☆21Updated 2 years ago