zhuzilin/pytorch-malloc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhuzilin/pytorch-malloc)

zhuzilin / pytorch-malloc

An external memory allocator example for PyTorch.

☆16

Alternatives and similar repositories for pytorch-malloc

Users that are interested in pytorch-malloc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

feifeibear / PSTensor
View on GitHub
PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.
☆10Feb 10, 2022Updated 4 years ago
feifeibear / PyTorchMemTracer
View on GitHub
Depict GPU memory footprint during DNN training of PyTorch
☆11Nov 17, 2022Updated 3 years ago
megvii-research / IntLLaMA
View on GitHub
IntLLaMA: A fast and light quantization solution for LLaMA
☆19Jul 21, 2023Updated 3 years ago
zhuzilin / chatgpt-desktop
View on GitHub
Desktop version of ChatGPT, support manually set cookie
☆19Dec 9, 2022Updated 3 years ago
bytedance / QSync
View on GitHub
Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".
☆20Feb 23, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SymbioticLab / ModelKeeper
View on GitHub
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
☆36Jan 9, 2023Updated 3 years ago
xinjin / course-net-seminar
View on GitHub
Selected Topics in Computer Networks @ Johns Hopkins University
☆19Dec 17, 2020Updated 5 years ago
ryantd / veloce
View on GitHub
WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.
☆17Aug 4, 2022Updated 3 years ago
microsoft / SparTA
View on GitHub
☆167Jul 22, 2024Updated 2 years ago
amazon-science / FeatGraph
View on GitHub
☆69Jun 16, 2021Updated 5 years ago
vllm-project / vllm-nccl
View on GitHub
Manages vllm-nccl dependency
☆18Jun 3, 2024Updated 2 years ago
jack-willturner / nas-as-program-transformation-exploration
View on GitHub
The code for our paper "Neural Architecture Search as Program Transformation Exploration"
☆17Apr 28, 2021Updated 5 years ago
hpcaitech / Elixir
View on GitHub
Elixir: Train a Large Language Model on a Small GPU Cluster
☆16Jun 8, 2023Updated 3 years ago
S-Lab-System-Group / Hydro
View on GitHub
Surrogate-based Hyperparameter Tuning System
☆30Jun 29, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SwarmArch / T4
View on GitHub
Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"
☆29Feb 18, 2022Updated 4 years ago
LeiWang1999 / TVM.CMakeExtend
View on GitHub
Tutorials of Extending and importing TVM with CMAKE Include dependency.
☆16Oct 11, 2024Updated last year
Adlik / model_zoo
View on GitHub
☆11Dec 26, 2025Updated 6 months ago
tlkh / depsep-conv-benchmarks
View on GitHub
Code for Depth-wise Separable Convolutions: Performance Investigations
☆19Jan 28, 2020Updated 6 years ago
TiledTensor / TiledLower
View on GitHub
TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.
☆13Nov 23, 2024Updated last year
AlibabaPAI / torchacc
View on GitHub
PyTorch distributed training acceleration framework
☆56Aug 13, 2025Updated 11 months ago
Oneflow-Inc / oneflow-documentation
View on GitHub
oneflow documentation
☆69Jun 26, 2024Updated 2 years ago
mayJJ / ket
View on GitHub
☆12Aug 5, 2018Updated 7 years ago
alibaba / GPU-scheduler-for-deep-learning
View on GitHub
GPU-scheduler-for-deep-learning
☆214Nov 5, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
vllm-project / tml-fa4
View on GitHub
FA4-based Relative Attention Kernel developed by TML and Colfax
☆17Updated this week
thu-pacman / PET
View on GitHub
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
☆126Jun 23, 2022Updated 4 years ago
Azure / msccl-executor-nccl
View on GitHub
☆47Dec 13, 2024Updated last year
lmbxmu / CLR-RNF
View on GitHub
Pytorch implementation of our paper (TNNLS) -- Pruning Networks with Cross-Layer Ranking & k-Reciprocal Nearest Filters
☆12Feb 24, 2022Updated 4 years ago
carefree0910 / carefree-flow
View on GitHub
Deep Learning ❤️ OneFlow
☆19Aug 26, 2021Updated 4 years ago
Oneflow-Inc / oneflow-lite
View on GitHub
☆17Jan 1, 2024Updated 2 years ago
restran / wiz-search
View on GitHub
✏️ Offline Full Text Search for Wiz Note Mac Client
☆10May 15, 2019Updated 7 years ago
zhuohan123 / terapipe
View on GitHub
☆79May 4, 2021Updated 5 years ago
jgoeders / dac_sdc_2021
View on GitHub
☆23Oct 7, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MPSLab-ASU / dMazeRunner
View on GitHub
dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators
☆48Apr 4, 2022Updated 4 years ago
zdaiot / wiznote2hexo2csdn
View on GitHub
为知笔记markdown转为hexo博客markdown，hexo博客markdown转外链图片的markdown(可直接复制到csdn、简书等)
☆10Oct 29, 2019Updated 6 years ago
spcl / substation
View on GitHub
Research and development for optimizing transformers
☆132Feb 16, 2021Updated 5 years ago
sjtu-epcc / DVABatch
View on GitHub
☆21May 13, 2022Updated 4 years ago
xdit-project / DiTCacheAnalysis
View on GitHub
An auxiliary project analysis of the characteristics of KV in DiT Attention.
☆34Nov 29, 2024Updated last year
plasma-umass / DoubleTake
View on GitHub
Evidence-based dynamic analysis: a fast checker for memory errors.
☆21Apr 22, 2017Updated 9 years ago
ucasligang / awesome-VisonTransformers
View on GitHub
Reading list for research topics in Vison Transformers
☆17Aug 31, 2022Updated 3 years ago