SYCL implementation of Fused MLPs for Intel GPUs
☆51Jun 11, 2026Updated this week
Alternatives and similar repositories for tiny-dpcpp-nn
Users that are interested in tiny-dpcpp-nn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆65May 27, 2026Updated 2 weeks ago
- ☆63Dec 18, 2024Updated last year
- C++ pipeline with OpenVINO native API for Stable Diffusion v1.5☆13Feb 23, 2024Updated 2 years ago
- ☆24May 26, 2026Updated 2 weeks ago
- A simulation framework for modeling efficiency of Graph Neural Network Dataflows☆24Feb 14, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Ribbon Menu for React☆20Feb 23, 2024Updated 2 years ago
- Code for paper "Beyond Closure Models: Learning Chaotic Systems via Physics-Informed Neural Operators".☆16Dec 24, 2025Updated 5 months ago
- ☆20Jan 17, 2024Updated 2 years ago
- Benchmarks of different devices I have come across☆43Aug 28, 2025Updated 9 months ago
- ☆24Apr 15, 2026Updated last month
- Reader for CalculiX .dat files☆11May 12, 2025Updated last year
- Enhancing the convergence speed by 2x and improving the training success of Physics-Informed Neural Networks (PINNs).☆13Oct 14, 2024Updated last year
- Super fast FP32 matrix multiplication on RDNA3☆90Mar 30, 2025Updated last year
- Active learning of extreme events using deep neural operators.☆16Nov 10, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Zig regex experiment☆13Nov 6, 2025Updated 7 months ago
- Official Implementation of "AIVT: Inference of turbulent thermal convection from measured 3D velocity data by physics-informed Kolmogorov…☆16Oct 10, 2025Updated 8 months ago
- JIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal☆13Updated this week
- DeskVOX is a real-time visualization tool for 3D data sets like image stacks from CT or MRI scanners, or confocal microscopes. It has an …☆21Jun 3, 2026Updated last week
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆22Mar 23, 2026Updated 2 months ago
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆77Updated this week
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆10Feb 26, 2025Updated last year
- ☆18Aug 9, 2023Updated 2 years ago
- ROCm Documentation Python package for ReadTheDocs build standardization☆15Jun 7, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆268May 13, 2026Updated last month
- Julian macros for wrapping ccall☆14May 27, 2021Updated 5 years ago
- ☆19Nov 6, 2024Updated last year
- Make std::mdspan formattable by std::format.☆11Dec 25, 2023Updated 2 years ago
- A tool allowing students of Coursera's Heterogeneous Parallel Programming to work on homework using a machine without a CUDA GPU.☆11Mar 11, 2015Updated 11 years ago
- Ansible Role - Easy and flexible dotfile installation with stow.☆11Oct 6, 2023Updated 2 years ago
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Oct 23, 2023Updated 2 years ago
- Weight Initialization Schemes for Deep Learning Frameworks☆10Nov 4, 2024Updated last year
- Translation layer from ANARI to OSPRay, ANARILibrary and ANARIDevice "ospray".☆21Apr 3, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆95Updated this week
- ☆13Updated this week
- Proof of concept for type system with unions, intersections and complements.☆14Apr 21, 2023Updated 3 years ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆136Apr 10, 2026Updated 2 months ago
- Code and experiments for the NeurIPS 2023 paper Stabilized Neural Differential Equations for Learning Dynamics with Explicit Constraints☆12Mar 26, 2024Updated 2 years ago
- ☆20Mar 27, 2023Updated 3 years ago
- ☆18May 6, 2026Updated last month