Drop-in library for tracking the memory allocations of CUDA applications
☆14Nov 17, 2017Updated 8 years ago
Alternatives and similar repositories for cuda-malloc-hook
Users that are interested in cuda-malloc-hook are comparing it to the libraries listed below
Sorting:
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 6 months ago
- Fine-grained GPU sharing primitives☆148Jul 28, 2025Updated 7 months ago
- GitBucket Docker Image☆10Jul 17, 2024Updated last year
- Virtual Audio Loopback Cable for Windows☆10Sep 18, 2022Updated 3 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Tool to display/decode CPUINFO☆10Oct 22, 2018Updated 7 years ago
- Hanzi to Pinyin engine in Swift 拼音输入法引擎☆14Mar 29, 2024Updated last year
- A C++ implementation of stft, melspectrogram and mel_to_stft☆10Jun 2, 2022Updated 3 years ago
- FalkorDB port to Rust☆12Jul 29, 2025Updated 7 months ago
- Exposes batch message receives (recvmmsg)☆14Aug 15, 2025Updated 6 months ago
- Simple, optimized, embedded, persistent (file-based) key-value cache☆13Mar 30, 2023Updated 2 years ago
- Backprop with Low-Precision Activations☆11Oct 28, 2019Updated 6 years ago
- mount hfs (HFS, HFS+, HFSX) usb/sd on iOS without iFile☆11Sep 9, 2015Updated 10 years ago
- GPT Demo with hybrid distributed training☆10Dec 1, 2022Updated 3 years ago
- Dockerfile for building remix-ide docker image☆10Jan 17, 2020Updated 6 years ago
- Hikaru no Go, GBA translation☆13Sep 15, 2017Updated 8 years ago
- Use LD_PRELOAD to redirect socket ports or unix domain socket paths☆13Sep 29, 2020Updated 5 years ago
- This is a simple demonstration for running Tensorflow inception v3 model on TensorRT☆12Jun 5, 2018Updated 7 years ago
- Distributed Embeddable Database☆12Sep 25, 2020Updated 5 years ago
- Calculate SHA256 checksums of objects on Amazon S3.☆11Sep 6, 2024Updated last year
- OpenGL interop example using WGL_NV_DX_interop2☆10Mar 8, 2018Updated 7 years ago
- Intercepting CUDA runtime calls with LD_PRELOAD☆43Mar 11, 2014Updated 11 years ago
- C++ "spatial" logging system which targets versatile, (de)clutchable, _debugging_, in a single header.☆17Oct 1, 2024Updated last year
- A simple file server written in Go. Allows files to be uploaded, downloaded, or deleted.☆10Sep 28, 2025Updated 5 months ago
- A decentralised application that creates high quality machine learning datasets☆13Jan 22, 2019Updated 7 years ago
- Research & Development for Golem project☆21Dec 10, 2018Updated 7 years ago
- A Translation Task using TurboTransformers☆11Dec 17, 2020Updated 5 years ago
- simple snake game coded in python, using PyQt☆13Dec 9, 2019Updated 6 years ago
- Kingul: Korean Keyboard for Kindle E-readers☆14Jan 26, 2025Updated last year
- 实验室私人网盘搭建☆10Jul 29, 2019Updated 6 years ago
- Docker Volume Plugin for CephFS☆13Nov 27, 2019Updated 6 years ago
- TensorFlow implementation of a decentralized distributed deep learning, AKO.☆10Aug 9, 2018Updated 7 years ago
- User-Level Online Mobile Computation Offloading Framework☆11Jan 5, 2019Updated 7 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- Layer-wise Sparsification of Distributed Deep Learning☆10Jul 6, 2020Updated 5 years ago
- Django + vue + 前后端分离 备忘录小项目☆13Dec 11, 2022Updated 3 years ago
- Automated neural architecture search algorithms implemented in PyTorch and Autogluon toolkit.☆12Apr 17, 2020Updated 5 years ago
- DoraNetwork BlockChain Core Project(Dora Network)☆15Nov 9, 2018Updated 7 years ago
- 互联网校招复习资料☆10May 15, 2019Updated 6 years ago