☆19May 17, 2016Updated 10 years ago
Alternatives and similar repositories for EECS-368-Programming-Massively-Parallel-Processors-with-CUDA
Users that are interested in EECS-368-Programming-Massively-Parallel-Processors-with-CUDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Some CUDA projects and utility☆27Nov 7, 2019Updated 6 years ago
- Radix sort analyses in parallel and serial ways.☆11Jan 21, 2016Updated 10 years ago
- A new QR decomposition algorithm implemented in CUDA☆18Jun 24, 2024Updated last year
- Erlang syslog logger☆27Jun 12, 2013Updated 13 years ago
- Integration of Tiramisu (Compiler) into PyTorch☆25May 27, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Minimilast Redis Client for Erlang☆19Jul 15, 2013Updated 12 years ago
- Solution of Programming Massively Parallel Processors☆51Jan 15, 2024Updated 2 years ago
- ☆12Oct 22, 2019Updated 6 years ago
- A tool allowing students of Coursera's Heterogeneous Parallel Programming to work on homework using a machine without a CUDA GPU.☆11Mar 11, 2015Updated 11 years ago
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆13Apr 17, 2025Updated last year
- Declarative MLIR compilers in Python!☆35Oct 9, 2020Updated 5 years ago
- CUDA by practice☆136Jan 7, 2020Updated 6 years ago
- ☆25Oct 9, 2025Updated 8 months ago
- CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.☆33Aug 31, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Odysseus: Playground of LLM Sequence Parallelism☆81Jun 17, 2024Updated 2 years ago
- 人工智能导论课程设计-用强化学习玩FlappyBird☆18Mar 25, 2020Updated 6 years ago
- This is a simple 2d convolution written in cuda c which uses shared memory for better performance☆20Apr 12, 2018Updated 8 years ago
- A collection of awesome algorithms, implemented in CUDA.☆26Feb 6, 2018Updated 8 years ago
- HQEMU v2.5.1 is a retargetable and multi-threaded dynamic binary translator on multicores☆25Mar 21, 2018Updated 8 years ago
- Misc hacks on Kingston Mobile Wireless G2☆12Mar 2, 2018Updated 8 years ago
- Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts☆26Aug 29, 2022Updated 3 years ago
- Examples and exercises from the book Programming Massively Parallel Processors - A Hands-on Approach. David B. Kirk and Wen-mei W. Hwu (T…☆79Jan 21, 2021Updated 5 years ago
- ☆14Jan 29, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PwnPI, skip all the errors while installing PwnPI on your RPI3, just follow the README.☆13May 19, 2017Updated 9 years ago
- Building Machine Learning Infrastructure!☆45Jan 16, 2019Updated 7 years ago
- Distributed Fieldaware Factorization Machines based on Parameter Server☆11Jan 5, 2018Updated 8 years ago
- Python scripts to facilitate easy working☆11Mar 23, 2026Updated 2 months ago
- Documentation on using the built-in Python debugger, PDB.☆24Dec 8, 2022Updated 3 years ago
- Reinforcement learning modular with pytorch☆11Jan 18, 2021Updated 5 years ago
- ☆14Jun 23, 2025Updated 11 months ago
- Some VxWorks fuzzing examples using Cisco-Kitty and WDBDbg framework☆19Mar 13, 2016Updated 10 years ago
- Companion service to Thomas Jacquin's excellent allsky cam (https://github.com/thomasjacquin/allsky). ClearSkyAlarm counts the stars in t…☆11Sep 13, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A simple and ready to use akka http client for building HTTP requests and processing responses with custom response handlers☆11Dec 23, 2019Updated 6 years ago
- ☆17Sep 2, 2023Updated 2 years ago
- Open-source LoRa base station for Meshtastic/MeshCore. Raspberry Pi + SX1302/SX1303 concentrator: passive multi-channel capture, local da…☆170Jun 6, 2026Updated last week
- ☆11Apr 17, 2021Updated 5 years ago
- Guide I wrote mostly for myself on how to run mlc-llm on the Orange Pi 5 Pro☆25Aug 15, 2025Updated 10 months ago
- RDMA Optimization on MXNet☆14Nov 12, 2017Updated 8 years ago
- Python script that uses the scapy library to create and send pings of death.☆14Feb 11, 2021Updated 5 years ago