oneAPI - Data Parallel C++ course for students
☆44Nov 4, 2024Updated last year
Alternatives and similar repositories for oneAPI_course
Users that are interested in oneAPI_course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆69Updated this week
- The repository contains a reference end-to-end pipeline for a real-time video analytics application. Realtime data is provided to an infe…☆12Nov 3, 2025Updated 4 months ago
- multithreads grep☆26May 12, 2019Updated 6 years ago
- Repository for CPU Kernel Generation for LLM Inference☆28Jul 13, 2023Updated 2 years ago
- vLLM Daily Summarization of Merged PRs☆48Updated this week
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆64Sep 18, 2025Updated 6 months ago
- ☆12Mar 28, 2023Updated 2 years ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- KFunca: A minimalist, high-performance GPU-based automatic differentiation framework☆29Aug 14, 2025Updated 7 months ago
- Documentation for TCP Lab☆12May 20, 2025Updated 10 months ago
- OpenAI Triton backend for Intel® GPUs☆236Updated this week
- ☆12May 25, 2021Updated 4 years ago
- Inference Llama 2 with a model compiled to native code by TorchInductor☆14Feb 8, 2024Updated 2 years ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- ☆24Updated this week
- Provides a tool to test CXL with Kernel and Qemu setup☆21Feb 20, 2026Updated last month
- Exercises for CppCon 2018 class on parallelism☆12Oct 10, 2019Updated 6 years ago
- PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolution…☆19Jan 22, 2026Updated 2 months ago
- ☆60Dec 18, 2024Updated last year
- ☆22Jul 31, 2019Updated 6 years ago
- Pillow Performance Tests☆17Mar 2, 2026Updated 3 weeks ago
- Official implementation of "Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving"☆29May 8, 2025Updated 10 months ago
- Randomized algorithm class at CU☆15Jul 8, 2025Updated 8 months ago
- Intel® Tensor Processing Primitives extension for Pytorch*☆18Mar 3, 2026Updated 3 weeks ago
- ☆28Jan 7, 2023Updated 3 years ago
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"☆13Jun 7, 2023Updated 2 years ago
- Hardware Accelerated Pytorch Container with (also accelerated) ffmpeg & OpenCV 4☆24May 1, 2023Updated 2 years ago
- Simple implementation of Fuzzy C-means algorithm using python. It is used for soft clustering purpose. Visualizing the algorithm step by …☆16May 3, 2022Updated 3 years ago
- [NeurIPS 2022] "NSNet: A General Neural Probabilistic Framework for Satisfiability Problems"☆19Mar 29, 2023Updated 2 years ago
- parser script to process pytorch autograd profiler result, convert json file to excel.☆15Oct 8, 2019Updated 6 years ago
- ☆12Jul 6, 2022Updated 3 years ago
- ☆14Feb 2, 2021Updated 5 years ago
- Persistent Memory Tool Box☆12Mar 4, 2024Updated 2 years ago
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆17Aug 21, 2023Updated 2 years ago
- Mirror only see https://gitlab.rtems.org/rtems/pkg/rtems-libbsd☆35Mar 13, 2026Updated last week
- WRD 的 WebVPN 的 URL 互转原理🌚☆77May 26, 2024Updated last year
- Torch Frontend for IREE☆26Dec 21, 2023Updated 2 years ago
- Benchmarks to capture important workloads.☆32Updated this week
- ☆30Jan 24, 2026Updated 2 months ago