☆30Sep 4, 2023Updated 2 years ago
Alternatives and similar repositories for oh-my-server
Users that are interested in oh-my-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Performance benchmarking with ColossalAI☆39Jul 6, 2022Updated 3 years ago
- Accuracy 77%. Large batch deep learning optimizer LARS for ImageNet with PyTorch and ResNet, using Horovod for distribution. Optional acc…☆38Jun 1, 2021Updated 5 years ago
- A Python library transfers PyTorch tensors between CPU and NVMe☆124Nov 27, 2024Updated last year
- ☆24Oct 14, 2022Updated 3 years ago
- Elixir: Train a Large Language Model on a Small GPU Cluster☆16Jun 8, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Examples of training models with hybrid parallelism using ColossalAI☆339Mar 23, 2023Updated 3 years ago
- PyTorch implementation of LAMB for ImageNet/ResNet-50 training☆13May 13, 2021Updated 5 years ago
- A collection of models built with ColossalAI☆33Nov 22, 2022Updated 3 years ago
- A Suite for Parallel Inference of Diffusion Transformers (DiTs) on multi-GPU Clusters☆58May 3, 2026Updated 2 months ago
- A curated list of awesome projects and papers for distributed training or inference☆279Oct 8, 2024Updated last year
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year
- Scalable PaLM implementation of PyTorch☆190Dec 19, 2022Updated 3 years ago
- ☆14Apr 1, 2025Updated last year
- websocket-benchmark☆11May 27, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of nougat that focuses on processing pdf locally.☆85Jan 15, 2025Updated last year
- A baseline repository of Auto-Parallelism in Training Neural Networks☆145Jun 25, 2022Updated 4 years ago
- High Performance Grouped GEMM in PyTorch☆30May 10, 2022Updated 4 years ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Torch Distributed Experimental☆117Aug 5, 2024Updated last year
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Feb 10, 2022Updated 4 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆17Feb 15, 2025Updated last year
- ☆16Nov 2, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GPT Demo with hybrid distributed training☆10Dec 1, 2022Updated 3 years ago
- Sky Computing: Accelerating Geo-distributed Computing in Federated Learning☆90Nov 22, 2022Updated 3 years ago
- Give us minutes, we give back a faster Mamba. The official implementation of "Faster Vision Mamba is Rebuilt in Minutes via Merged Token …☆39Dec 18, 2024Updated last year
- A curated list for Efficient Large Language Models☆11Mar 25, 2024Updated 2 years ago
- Efficient Dataset Distillation by Representative Matching☆114Feb 28, 2024Updated 2 years ago
- An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional var…☆152Apr 10, 2026Updated 2 months ago
- A Translation Task using TurboTransformers☆10Dec 17, 2020Updated 5 years ago
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆108May 23, 2024Updated 2 years ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆29Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- Efficient Auto-scalable Scientific Infrastructure for Engineers and Researchers☆15Sep 8, 2025Updated 9 months ago
- Function Plot plugin for Draw.io Desktop.☆21Apr 9, 2022Updated 4 years ago
- PiX: Dynamic Channel Sampling for ConvNets (CVPR 2024)☆13Jun 14, 2024Updated 2 years ago
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆346Sep 24, 2024Updated last year
- Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift (ICCV 2021)☆20Nov 28, 2021Updated 4 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 10 months ago