We aim to redefine Data Parallel libraries portabiliy, performance, programability and maintainability, by using C++ standard features, instead of creating new compilers.
☆51Updated this week
Alternatives and similar repositories for FusedKernelLibrary
Users that are interested in FusedKernelLibrary are comparing it to the libraries listed below
Sorting:
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆54Updated this week
- diffusers with search engine☆11Jan 13, 2026Updated last month
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆33Nov 29, 2024Updated last year
- CLIP and SigLIP models optimized with TensorRT with a Transformers-like API☆30Sep 29, 2024Updated last year
- HPC tests using MPI codes & synthetic benchmarks with IB/RoCE comparisions - from StackHPC Ltd.☆21Jul 11, 2022Updated 3 years ago
- ☆31Jul 16, 2025Updated 7 months ago
- Framework to reduce autotune overhead to zero for well known deployments.☆97Sep 19, 2025Updated 5 months ago
- Implementation of popular Distributed Systems algorithms in C++☆26Dec 14, 2024Updated last year
- A bunch of kernels that might make stuff slower 😉☆75Feb 18, 2026Updated last week
- Zero-copy multimodal vector DB with CUDA and CLIP/SigLIP☆65May 6, 2025Updated 9 months ago
- ☆33Jan 6, 2025Updated last year
- The Task-Aware MPI (TAMPI) library extends the functionality of standard MPI libraries by providing new mechanisms for improving the inte…☆25Jun 6, 2025Updated 8 months ago
- ☆10Dec 25, 2022Updated 3 years ago
- Create and deploy virtual-experiments - co-processing computational workflows☆10Jan 28, 2026Updated last month
- Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026)☆53Feb 23, 2026Updated last week
- Memory Topology for GPUs☆17Feb 13, 2026Updated 2 weeks ago
- ext_mpi_collectives☆11Apr 1, 2025Updated 11 months ago
- Helper package to spin-up a Qdrant instance without Docker☆13Dec 24, 2023Updated 2 years ago
- Tool to generate Android build system files (Android.mk, Android.bp) from APK automatically.☆10Nov 1, 2021Updated 4 years ago
- ☆10Updated this week
- Elmer/Ice course repository containing example cases and slide material.☆14Sep 29, 2025Updated 5 months ago
- A bash tool to create bootable images using Ansible, Docker and Dracut☆18Dec 16, 2024Updated last year
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- Integrated Global System Model☆11Apr 28, 2023Updated 2 years ago
- a software tool that facilitates the design of lightweight torsion springs☆17Sep 8, 2025Updated 5 months ago
- Single-Life Reinforcement Learning☆14Dec 17, 2022Updated 3 years ago
- ☆19Jun 28, 2025Updated 8 months ago
- HierCGRA: An Open-Source Framework for Large-Scale CGRA with Hierarchical Modeling and Automated Exploration☆14Mar 6, 2023Updated 2 years ago
- imageC / EVAnalyzer2 - High throughput biological image processor☆10Feb 13, 2026Updated 2 weeks ago
- Code for paper "Beyond Closure Models: Learning Chaotic Systems via Physics-Informed Neural Operators".☆14Dec 24, 2025Updated 2 months ago
- ☆10Oct 20, 2018Updated 7 years ago
- CANdle - a library for using USB-FDCAN dongle and communicating with md80 drives☆15Sep 15, 2025Updated 5 months ago
- GPU based 2D elastic FWI☆11Mar 6, 2018Updated 7 years ago
- ☆11Feb 27, 2024Updated 2 years ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- ☆10Jan 12, 2026Updated last month
- Program for adding, removing, editing and visualising events related to history☆10Feb 25, 2023Updated 3 years ago
- Port of syslinux to Mac OS X☆10Oct 2, 2019Updated 6 years ago
- [TPAMI-2018] A C++ framework for training/testing Support Vector Machine with Gaussian Sample Uncertainty (SVM-GSU).☆13Feb 20, 2018Updated 8 years ago