☆14Nov 7, 2025Updated 5 months ago
Alternatives and similar repositories for sc22-ae
Users that are interested in sc22-ae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- this is the release repository of superneurons☆54Feb 13, 2021Updated 5 years ago
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆14Dec 9, 2024Updated last year
- ☆31May 31, 2023Updated 2 years ago
- ☆15Jul 18, 2023Updated 2 years ago
- Experimental encrypted file system using SGX and FUSE☆12Oct 9, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆14Jun 4, 2024Updated last year
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Dec 10, 2022Updated 3 years ago
- The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".☆13Jun 7, 2021Updated 4 years ago
- ☆41Oct 11, 2025Updated 6 months ago
- ☆13Jan 28, 2026Updated 2 months ago
- Securing Data Analytics on Intel SGX using Randomization☆13Aug 30, 2017Updated 8 years ago
- ☆41Nov 28, 2022Updated 3 years ago
- PyTorch-UVM on super-large language models.☆17Dec 21, 2020Updated 5 years ago
- What if everything is a io_uring?☆17Nov 10, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Aug 18, 2025Updated 7 months ago
- 面向可信执行环境的OS。☆12May 9, 2025Updated 11 months ago
- Clio, ASPLOS'22.☆79Feb 8, 2022Updated 4 years ago
- Python bindings for the PMDK. Non-volatile memory for Python.☆13Mar 22, 2023Updated 3 years ago
- ☆22Sep 9, 2024Updated last year
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆15Dec 21, 2020Updated 5 years ago
- Prompt format and padding guide for Llama 2☆12Sep 18, 2023Updated 2 years ago
- C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation☆11Oct 16, 2023Updated 2 years ago
- ☆89Apr 2, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A low-cost, high-performance deep learning training framework that enables efficient 100B-scale model fine-tuning on a commodity server w…☆24Mar 21, 2025Updated last year
- [ISCA'25] LIA: A Single-GPU LLM Inference Acceleration with Cooperative AMX-Enabled CPU-GPU Computation and CXL Offloading☆12Jun 28, 2025Updated 9 months ago
- Benchmark PyTorch Custom Operators☆14Jul 6, 2023Updated 2 years ago
- ☆12Jun 10, 2023Updated 2 years ago
- [NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"☆11Nov 15, 2024Updated last year
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- ☆16May 22, 2023Updated 2 years ago
- InstAttention: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference☆16Mar 30, 2025Updated last year
- λ-IO: a unified I/O stack for computational storage [FAST'23]☆79Apr 29, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- ☆10Mar 3, 2024Updated 2 years ago
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆33May 21, 2024Updated last year
- ☆26Aug 19, 2022Updated 3 years ago
- A collection of awesome and useful resources for research.☆25Jun 5, 2025Updated 10 months ago
- A Python library transfers PyTorch tensors between CPU and NVMe☆125Nov 27, 2024Updated last year
- New IndexFS core☆26Mar 28, 2016Updated 10 years ago