roscisz / TensorHiveLinks
Tool for managing exclusive GPU access for distributed machine learning workloads
☆162Updated last year
Alternatives and similar repositories for TensorHive
Users that are interested in TensorHive are comparing it to the libraries listed below
Sorting:
- Python 3 Bindings for NVML library. Get NVIDIA GPU status inside your program.☆246Updated 3 years ago
- ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling …☆412Updated 2 months ago
- ClearML Agent - ML-Ops made easy. ML-Ops scheduler & orchestration solution☆267Updated 2 months ago
- Pytorch Lightning Distributed Accelerators using Ray☆210Updated last year
- ☆152Updated 2 years ago
- TF 2.x and PyTorch Lightning Callbacks for GPU monitoring☆92Updated 4 years ago
- Interactively retrieve data from sacred experiments.☆82Updated last month
- Python 3 Bindings for the NVIDIA Management Library☆139Updated 11 months ago
- Scheduling GPU cluster workloads with Slurm☆74Updated 6 years ago
- PyTorch dataset extended with map, cache etc. (tensorflow.data like)☆329Updated 2 years ago
- Hangar is version control for tensor data. Commit, branch, merge, revert, and collaborate in the data-defined software era.☆204Updated 4 years ago
- The spiritual successor to knockknock for PyTorch Lightning, get notified when your training ends☆77Updated 6 months ago
- Web-based dashboard for Sacred☆548Updated 2 years ago
- Management Dashboard for Torchserve☆122Updated 2 years ago
- Dashboard for sacred. Monitor and access your past machine learning experiments.☆184Updated 6 years ago
- PyTorch model training and layer saturation monitor☆81Updated 2 years ago
- Aggregate multiple tensorboard runs to new summary or csv files☆173Updated last week
- PyProf2: PyTorch Profiling tool☆82Updated 4 years ago
- Provide Python access to the NVML library for GPU diagnostics☆236Updated 6 months ago
- Simple tooling for marking deprecated functions or classes and re-routing to the new successors' instance.☆51Updated last month
- Lightweight interface to AWS☆47Updated 5 years ago
- MLCube® is a project that reduces friction for machine learning by ensuring that models are easily portable and reproducible.☆157Updated 8 months ago
- Deep Learning project template best practices with Pytorch Lightning, Hydra, Tensorboard.☆159Updated 4 years ago
- Train ImageNet in 18 minutes on AWS☆130Updated last year
- Deep Learning Benchmarking Suite☆129Updated 2 years ago
- Parameterized fit and prediction harnesses for pytorch☆40Updated 4 years ago
- A tool for enriching the output of nvidia-smi.☆564Updated last year
- Minimal implementation of adaptive gradient clipping (https://arxiv.org/abs/2102.06171) in TensorFlow 2.☆84Updated 3 years ago
- Using the famous cnn model in Pytorch, we run benchmarks on various gpu.☆235Updated 11 months ago
- A collection of code snippets for my PyTorch Lightning projects☆107Updated 4 years ago