The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".
☆13Jun 7, 2021Updated 4 years ago
Alternatives and similar repositories for BPPSA-open
Users that are interested in BPPSA-open are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch wrapper of parallel exclusive scan in CUDA☆12May 25, 2023Updated 2 years ago
- SixArm.com » Brew install scripts for our various packages☆12Apr 14, 2025Updated 11 months ago
- ☆15Oct 5, 2014Updated 11 years ago
- Manage Multimodal Agentic Context Lifecycle with Lance☆62Mar 4, 2026Updated 3 weeks ago
- Stanford CS231n Convolutional Neural Networks for Visual Recognition Assignments☆11Aug 5, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Alpine Linux [Docker]☆11Jan 11, 2026Updated 2 months ago
- ☆33Oct 20, 2025Updated 5 months ago
- ☆26Aug 21, 2022Updated 3 years ago
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"☆28Feb 18, 2022Updated 4 years ago
- Homework solutions to 2017 Fall Algorithm Courses in ShanghaiTech☆10Jan 5, 2018Updated 8 years ago
- Switch-based Training Acceleration for Machine Learning (SwitchML)☆16Apr 13, 2021Updated 4 years ago
- The AI-powered CLI Assistant☆30May 24, 2024Updated last year
- Atamai Image Registration and Segmentation☆21Mar 1, 2026Updated 3 weeks ago
- Cilk application benchmark programs☆11Aug 20, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- nnvm&tvm example of cross compilation and deployment in Nvidia Jetson TX2 platform☆11Apr 17, 2018Updated 7 years ago
- an new parallel algorithm for LZ77 compression based on suffix array☆22Apr 8, 2013Updated 12 years ago
- Community maintained hardware plugin for vLLM on AWS Neuron☆24Mar 19, 2026Updated last week
- the symbol description of mobilenet v2☆11Sep 7, 2018Updated 7 years ago
- Automatically instrument your app, capture all logs, traces, and requests, then let Cursor fix bugs with full context.☆47Sep 5, 2025Updated 6 months ago
- ☆14Nov 7, 2025Updated 4 months ago
- Compressive Read-mapping Accelerator☆14Sep 7, 2016Updated 9 years ago
- Homebrew formulas for installing LLM and related tools☆15Sep 6, 2023Updated 2 years ago
- modified cutlass☆15Oct 26, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆11Oct 11, 2023Updated 2 years ago
- The Amazon ECR Transfer Plugin for Data Transfer Hub(https://github.com/awslabs/data-transfer-hub). Transfer container images from Amazon…☆13Jan 29, 2025Updated last year
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- Artifact evaluation of the paper "Accelerating Training and Inference of Graph Neural Networks with Fast Sampling and Pipelining"☆23Mar 7, 2022Updated 4 years ago
- A JAX implementation of stochastic addition.☆14Aug 15, 2022Updated 3 years ago
- PilotFish harvests the free GPU cycles of cloud gaming with deep learning training☆14Jul 2, 2022Updated 3 years ago
- ☆44Nov 15, 2021Updated 4 years ago
- This repository corresponds to the PICCO compiler for secure multi-party computation published in 2013 with more recent efficiency improv…☆12Mar 12, 2026Updated 2 weeks ago
- ordspecsim: The Swarm architecture simulator☆24Feb 15, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Tool for inferring cache replacement policies with automata learning. Uses LearnLib and Sketch.☆16Apr 21, 2020Updated 5 years ago
- Creating your own container in Linux.☆12Jan 10, 2019Updated 7 years ago
- 金沢人工知能勉強会・交流会で使用した資料置き場です。発表に使用したスライドやPythonのプログラムなど☆12Oct 11, 2020Updated 5 years ago
- Deep Learning inference with AWS Lambda and Amazon EFS☆14Aug 24, 2020Updated 5 years ago
- A DMA Controller for RISCV CPUs☆13Aug 10, 2015Updated 10 years ago
- sample C++ code of googletest on Circle CI☆14Feb 27, 2018Updated 8 years ago
- 使用预训练语言模型ALBERT做中文NER☆12Jul 14, 2021Updated 4 years ago