Practical exercises for HOW Series "Deep Dive", a Web-based training on parallel programming and performance optimization
☆33Feb 1, 2019Updated 7 years ago
Alternatives and similar repositories for HOW-Series-Labs
Users that are interested in HOW-Series-Labs are comparing it to the libraries listed below
Sorting:
- ☆18Jul 26, 2024Updated last year
- Intel specific developments☆22Apr 8, 2021Updated 4 years ago
- Pure Triton kernels for Qwen3.5-27B inference on NVIDIA B200☆66Updated this week
- ☆44Updated this week
- A powerful Laravel storage driver that enables seamless synchronization of files across multiple disks, with an integrated cache disk for…☆15Nov 11, 2025Updated 3 months ago
- ☆111Updated this week
- Multi-GPU communication profiler and visualizer☆38Jun 10, 2024Updated last year
- A collection of reproducible inference engine benchmarks☆38Apr 22, 2025Updated 10 months ago
- ☆28Dec 3, 2025Updated 3 months ago
- A batched implementation for efficient Qwen2.5-VL inference.☆22Jul 16, 2025Updated 7 months ago
- A SystemVerilog-based simulation and design of a Last Level Cache (LLC) implementing the MESI protocol, featuring Pseudo-LRU replacement,…☆15Nov 24, 2025Updated 3 months ago
- 详细双语注释版word2vec源码,well-annotated word2vec☆10Oct 3, 2021Updated 4 years ago
- The SEAL-CPU backend is a Reference backend engine for HEBench which is a shared library that implements the required functions specified…☆11Mar 3, 2023Updated 3 years ago
- DuraCloud open source project☆18Dec 3, 2025Updated 3 months ago
- Copy millions of objects in minutes☆12Oct 21, 2019Updated 6 years ago
- The meat and potatoes behind farosctl☆13Feb 28, 2023Updated 3 years ago
- ☆11May 14, 2022Updated 3 years ago
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- Sparse Matrix Factorization (SMF) is a key component in many machine learning problems and there exist a verity a applications in real-w…☆11Jan 25, 2016Updated 10 years ago
- A curated collection of practical Laravel tips, tricks, and best practices to help you write cleaner, faster, and more efficient code. Wh…☆10Apr 13, 2025Updated 10 months ago
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆177Feb 27, 2026Updated last week
- optimize TCP settings and download speeds of applications on Windows systems for improved network performance☆11Apr 29, 2025Updated 10 months ago
- pip install patchelf. patchelf Python wheel for PyPI.☆11Updated this week
- ☆14Feb 11, 2026Updated 3 weeks ago
- Enterprise AWS 3-tier infrastructure blueprint with zero-cost validation strategy • 57 resources across 6 Terraform modules • Safety-firs…☆25Nov 5, 2025Updated 4 months ago
- White paper/journal paper on best practices developing sustainable scientific software☆10Jul 4, 2017Updated 8 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- Portfolio and blog for my online brand. Performance Optimized Single Page React App.☆11Nov 7, 2016Updated 9 years ago
- My personal dotfiles with automated macOS setup. Features smart installation scripts, Bats testing (bash), performance monitoring, and 2…☆11Feb 6, 2026Updated 3 weeks ago
- Just simple JavaScript framework. Provides support for manipulating with DOM and events handling. Easy for use, optimized for performance…☆11Feb 15, 2017Updated 9 years ago
- Core code for Profiles RNS☆19Dec 18, 2025Updated 2 months ago
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated 11 months ago
- ☆10Nov 22, 2022Updated 3 years ago
- ☆15Apr 6, 2016Updated 9 years ago
- A conda-smithy repository for ctng-compiler-activation.☆14Feb 12, 2026Updated 3 weeks ago
- Code for "What really matters in matrix-whitening optimizers?"☆22Oct 31, 2025Updated 4 months ago
- The proposal of this work involves a simulation of an ant colony swarm that was applied to a problem of search and rescue of objects of i…☆12Aug 5, 2023Updated 2 years ago
- ☆13Jan 7, 2025Updated last year
- Speeding Up Your Python Codes 1000x☆12Apr 2, 2025Updated 11 months ago