A storage solution for PyTorch tensors with distributed tensor support.
☆80May 29, 2026Updated 2 weeks ago
Alternatives and similar repositories for torchstore
Users that are interested in torchstore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Single Controller☆1,042Updated this week
- An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.☆84Updated this week
- Creating Generative AI Apps which work☆17Apr 14, 2025Updated last year
- paper and code for New Directions in Cloud Programming, CIDR 2021☆11Feb 17, 2021Updated 5 years ago
- Mirror of Plan 9 4th Edition from p9f☆14Mar 23, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A std::execution style runtime context and High Performance RPC Transport for using OpenUCX. Including CUDA/ROCM/... devices with RDMA.☆33May 26, 2026Updated 2 weeks ago
- CookingZoo: a gym-cooking derivative to simulate a complex cooking environment☆22Dec 6, 2024Updated last year
- LoRAFusion: Efficient LoRA Fine-Tuning for LLMs☆27Apr 8, 2026Updated 2 months ago
- Agentic Graph RAG: Skeleton Indexing + VectorCypher + Agentic Router with Self-Correction☆37Feb 26, 2026Updated 3 months ago
- The mgmt translator for Puppet manifests☆11Feb 27, 2024Updated 2 years ago
- Exploring how optimizations for GEMMs work☆36Feb 28, 2026Updated 3 months ago
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.☆419Updated this week
- eco4cast library aims to reduce carbon footprint of machine learning models with predictive cloud computing scheduling☆16Aug 26, 2024Updated last year
- For building the world's largest dataset of GPU kernels.☆10Jun 5, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- a collection of skills for vllm-omni☆74Updated this week
- System for queue detection and control☆10Dec 12, 2020Updated 5 years ago
- Development containers for triton and triton-cpu☆28Jun 3, 2026Updated last week
- Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts☆122Oct 17, 2024Updated last year
- PyTorch bindings for CUTLASS grouped GEMM.☆190Apr 8, 2026Updated 2 months ago
- ☆18Nov 5, 2023Updated 2 years ago
- ☆11Feb 2, 2018Updated 8 years ago
- Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.☆358Updated this week
- A toolkit for scaling law research ⚖☆64Jan 27, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference☆672May 21, 2026Updated 3 weeks ago
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64May 5, 2026Updated last month
- Sparse Backpropagation for Mixture-of-Expert Training☆30Jul 2, 2024Updated last year
- ☆10Sep 20, 2018Updated 7 years ago
- Holistic job manager on Kubernetes☆117Feb 20, 2024Updated 2 years ago
- Torch Distributed Experimental☆117Aug 5, 2024Updated last year
- 训练营训练方向项目☆27Jan 28, 2026Updated 4 months ago
- Implementation of a Tensorflow XLA rematerialization pass☆15Dec 20, 2019Updated 6 years ago
- API for coordinating Maintenance in Kubernetes.☆26Jul 18, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Dec 6, 2013Updated 12 years ago
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆30Nov 27, 2024Updated last year
- "Generating Music Medleys via Music Puzzle Games", AAAI 2018☆19Nov 6, 2018Updated 7 years ago
- Scalable toolkit for efficient model alignment☆851Oct 6, 2025Updated 8 months ago
- ☆11Feb 22, 2022Updated 4 years ago
- Studying GPU Multi-tenancy☆11Jan 11, 2019Updated 7 years ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆82Dec 18, 2025Updated 5 months ago