Optimize an example model with Python, CPP, and CUDA extensions and Ring-Allreduce.
☆110Dec 25, 2018Updated 7 years ago
Alternatives and similar repositories for pytorch-parallel
Users that are interested in pytorch-parallel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- C++ extensions in PyTorch☆1,185Jan 13, 2026Updated 3 months ago
- A quickstart and benchmark for pytorch distributed training.☆1,661Jul 25, 2024Updated last year
- CRNN_CTC_PyTorch☆10Oct 17, 2019Updated 6 years ago
- MG-WFBP: Merging Gradients Wisely for Efficient Communication in Distributed Deep Learning☆12Apr 26, 2021Updated 5 years ago
- an example of a CUDA extension for PyTorch using CuPy which computes the Hadamard product of two tensors☆119May 26, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An example of C++ extension for PyTorch.☆37Sep 24, 2019Updated 6 years ago
- Paper List for In-context Learning 🌷☆19Jan 3, 2023Updated 3 years ago
- This is a BNN_Kernel on PyTorch for 1-bit networks in image data processing☆23Sep 28, 2019Updated 6 years ago
- ☆17Mar 28, 2022Updated 4 years ago
- ☆15Oct 23, 2018Updated 7 years ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,532Apr 29, 2021Updated 5 years ago
- ☆130Nov 16, 2020Updated 5 years ago
- deformable_conv2d layer implemented in pytorch☆63Mar 4, 2019Updated 7 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simple middleware to improving GPU utilization then speedup online inference.☆19Feb 22, 2021Updated 5 years ago
- 'Bi-directional Relationship Inferring Network for Referring Image Segmentation' CVPR2020☆18Apr 2, 2022Updated 4 years ago
- [CVPR 2023] Official implementation of "Deep Dive into Gradients: Better Optimization for 3D Object Detection with Gradient-Corrected IoU…☆20Jun 9, 2023Updated 2 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆59Nov 28, 2022Updated 3 years ago
- Official code for "Writing Distributed Applications with PyTorch", PyTorch Tutorial☆266Dec 12, 2022Updated 3 years ago
- ddl-benchmarks: Benchmarks for Distributed Deep Learning☆36May 29, 2020Updated 5 years ago
- 分享计算机视觉每天的arXiv文章☆705Aug 17, 2019Updated 6 years ago
- This project provides a face recoganization system via opencv4☆18Jan 16, 2019Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆87Jan 22, 2021Updated 5 years ago
- Official pytorch implementation of DeformSyncNet: Deformation Transfer via Synchronized Shape Deformation Spaces☆24Dec 10, 2020Updated 5 years ago
- useful cuda code .☆43Mar 11, 2022Updated 4 years ago
- Awesome Few-shot learning☆53Jan 3, 2020Updated 6 years ago
- Official Pytorch implementation of "DBS: Dynamic Batch Size for Distributed Deep Neural Network Training"☆23Sep 30, 2021Updated 4 years ago
- ☆10Apr 10, 2019Updated 7 years ago
- ☆87Dec 16, 2020Updated 5 years ago
- Package speculatively provides a simple mechanism to re-execute a task in parallel only after some initial timeout has elapsed.☆10Jul 11, 2025Updated 9 months ago
- Deformable ConvNets V2 (DCNv2) in PyTorch☆1,487Nov 18, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CPM R-CNN: Calibrating Point-guided Misalignment in Object Detection☆10Nov 2, 2020Updated 5 years ago
- ☆30Sep 4, 2023Updated 2 years ago
- pytorch memory track code☆1,015May 4, 2021Updated 4 years ago
- SOLO: Segmenting Objects by Locations☆164Dec 20, 2021Updated 4 years ago
- [SIGGRAPH Asia 2025] Official github repo of SeqTex, an end-to-end 3D texture generation method using video diffusion priors.☆44Dec 12, 2025Updated 4 months ago
- Code for the paper "CompoNet: Learning to Generate the Unseen by Part Synthesis and Composition"☆28Oct 27, 2019Updated 6 years ago
- Code for "TSGCNeXt: Dynamic-Static Multi-Graph Convolution for Efficient Skeleton-Based Action Recognition with Long-term Learning Potent…☆38Sep 12, 2025Updated 7 months ago