SYCL implementation of Fused MLPs for Intel GPUs
☆51Jun 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for tiny-dpcpp-nn
Users that are interested in tiny-dpcpp-nn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆65May 27, 2026Updated last month
- ☆62Dec 18, 2024Updated last year
- ☆24May 26, 2026Updated last month
- Code for paper "Beyond Closure Models: Learning Chaotic Systems via Physics-Informed Neural Operators".☆16Dec 24, 2025Updated 6 months ago
- ☆20Jan 17, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Benchmarks of different devices I have come across☆43Jun 20, 2026Updated 2 weeks ago
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated last year
- ☆117May 10, 2026Updated last month
- Reader for CalculiX .dat files☆11May 12, 2025Updated last year
- Wave: Python Domain-Specific Language for High Performance Machine Learning☆58Jun 8, 2026Updated 3 weeks ago
- Enhancing the convergence speed by 2x and improving the training success of Physics-Informed Neural Networks (PINNs).☆13Oct 14, 2024Updated last year
- Super fast FP32 matrix multiplication on RDNA3☆92Mar 30, 2025Updated last year
- Active learning of extreme events using deep neural operators.☆16Nov 10, 2022Updated 3 years ago
- Sub-module for OpenFOAM that provides a solver for embedding SmartSim and its external dependencies (i.e. SmartRedis) into OpenFOAM.☆45Sep 10, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Implementation of "AIVT: Inference of turbulent thermal convection from measured 3D velocity data by physics-informed Kolmogorov…☆16Oct 10, 2025Updated 8 months ago
- JIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal☆13Jun 10, 2026Updated 3 weeks ago
- DeskVOX is a real-time visualization tool for 3D data sets like image stacks from CT or MRI scanners, or confocal microscopes. It has an …☆21Jun 3, 2026Updated last month
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆22Mar 23, 2026Updated 3 months ago
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆76Jun 26, 2026Updated last week
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆11Feb 26, 2025Updated last year
- scalable data movement in Exascale Supercomputers☆19Mar 30, 2026Updated 3 months ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆271Jun 25, 2026Updated last week
- Julian macros for wrapping ccall☆14May 27, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Aug 22, 2025Updated 10 months ago
- Make std::mdspan formattable by std::format.☆11Dec 25, 2023Updated 2 years ago
- ☆19Nov 2, 2025Updated 8 months ago
- Bundle Julia projects☆13Jan 20, 2025Updated last year
- Playing OpenAI games with Neuroevolution☆11Nov 16, 2019Updated 6 years ago
- Commands that will make you more comfortable with the ROCm toolkit.☆18Aug 1, 2024Updated last year
- Translation layer from ANARI to OSPRay, ANARILibrary and ANARIDevice "ospray".☆22Jun 25, 2026Updated last week
- RC7 Scripts for Roblox.☆24Dec 17, 2016Updated 9 years ago
- ☆97Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code and experiments for the NeurIPS 2023 paper Stabilized Neural Differential Equations for Learning Dynamics with Explicit Constraints☆12Mar 26, 2024Updated 2 years ago
- A list of OpenStack Security Best Practices - written in Markdown☆10Apr 8, 2015Updated 11 years ago
- Implementation of ConvMixer-Patches Are All You Need? in TensorFlow and Keras☆12Oct 31, 2021Updated 4 years ago
- 小彭老师推出 SyCL 2020 课程(施工中,日后会在直播中放出)☆15Sep 3, 2023Updated 2 years ago
- The vLLM XPU kernels for Intel GPU☆50Jun 24, 2026Updated last week
- Open-source pipeline to create augmented reality (AR) models from scientific data☆25Aug 7, 2023Updated 2 years ago
- Collection of scripts to build PyTorch and the domain libraries from source.☆14Jun 9, 2026Updated 3 weeks ago