dorlivne / PoPSView external linksLinks
PoPS algorithm
☆15Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for PoPS
Users that are interested in PoPS are comparing it to the libraries listed below
Sorting:
- All Resources from Stanford CS106B 2021☆23Jul 11, 2025Updated 7 months ago
- CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization☆13Aug 3, 2024Updated last year
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆29Updated this week
- 一个分析大型语言模型系统提示词的研究项目☆71Oct 13, 2025Updated 4 months ago
- 基于 FISCO BCOS / Vechain 的 超级 NFT 平台。☆11May 11, 2021Updated 4 years ago
- Implementing https://arxiv.org/abs/1612.02806☆13Sep 10, 2021Updated 4 years ago
- Multi-heap-sort for many small arrays, quicksort with 3 pivots for one big array, CUDA acceleration, CUDA memory compression.☆13Sep 29, 2024Updated last year
- ☆11Sep 21, 2022Updated 3 years ago
- Code for Q-learning with parametrized quantum circuits in OpenAI Gym environments.☆12Nov 12, 2021Updated 4 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆17Updated this week
- Stuy Pluggable AI☆11Oct 3, 2017Updated 8 years ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 5 months ago
- ☆12Aug 31, 2023Updated 2 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Feb 8, 2026Updated last week
- ☆17Nov 22, 2025Updated 2 months ago
- Cute layout visualization☆30Jan 18, 2026Updated 3 weeks ago
- 。☆13Jan 15, 2022Updated 4 years ago
- ☆14Nov 3, 2025Updated 3 months ago
- a reactor network library☆16Aug 21, 2025Updated 5 months ago
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 5 months ago
- 《汇编语言一发入魂》配套代码☆15May 30, 2020Updated 5 years ago
- Welcome to the GPU-FFT-Optimization repository! We present cutting-edge algorithms and implementations for optimizing the Fast Fourier Tr…☆20Dec 19, 2025Updated last month
- ☆15Mar 23, 2022Updated 3 years ago
- portFFT is a library implementing Fast Fourier Transforms using SYCL☆19Mar 1, 2025Updated 11 months ago
- Code implementation of "Information Design in Multi-Agent Reinforcement Learning"☆15Aug 18, 2023Updated 2 years ago
- ☆32Jul 2, 2025Updated 7 months ago
- auto grad in rust with video explanation.☆24Jun 19, 2025Updated 7 months ago
- To better understand the ggml library☆27Jun 13, 2025Updated 8 months ago
- Quantum Denoising Diffusion Models☆17Feb 8, 2024Updated 2 years ago
- ☆19Sep 23, 2020Updated 5 years ago
- [TMC'22] SplitPlace: AI Augmented Splitting and Placement of Large-Scale Neural Networks in Mobile Edge Environments☆21Dec 8, 2022Updated 3 years ago
- Implementation and optimization of matrix multiplication on single CPU (HPC-THU-2023-Autumn)☆18Feb 27, 2024Updated last year
- Implementation from scratch in CUDA C++ of image processing algorithms.☆21Oct 26, 2020Updated 5 years ago
- ToyLLM: Learning LLM from Scratch☆25Jan 26, 2026Updated 2 weeks ago
- ☆20Dec 29, 2024Updated last year
- ☆20Nov 3, 2024Updated last year
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆32Dec 21, 2024Updated last year
- ☆25Oct 10, 2024Updated last year
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 9 years ago