nvwacloud / tensorlink
Unlock Unlimited Potential! Share Your GPU Power Across Your Local Network!
☆46Updated 7 months ago
Alternatives and similar repositories for tensorlink:
Users that are interested in tensorlink are comparing it to the libraries listed below
- Implementation of remote CUDA/OpenCL protocol☆32Updated 8 months ago
- LM inference server implementation based on *.cpp.☆101Updated this week
- GPUd automates monitoring, diagnostics, and issue identification for GPUs☆278Updated this week
- cricket is a virtualization solution for GPUs☆181Updated this week
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆122Updated 2 years ago
- Using CRDs to manage GPU resources in Kubernetes.☆196Updated 2 years ago
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)☆251Updated last year
- VQLite - Simple and Lightweight Vector Search Engine based on Google ScaNN☆88Updated 7 months ago
- ☆107Updated 10 months ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆132Updated this week
- NVIDIA vGPU Device Manager manages NVIDIA vGPU devices on top of Kubernetes☆126Updated last week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆20Updated 2 months ago
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆71Updated this week
- Easier than K8s to lift and lower the gpu number of docker container and scale capacity size of volume.☆72Updated 10 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆62Updated 10 months ago
- Self-hosted huggingface mirror service.☆122Updated last week
- 🎉 An awesome & curated list of best LLMOps tools.☆40Updated last week
- Device-plugin for volcano vgpu which support hard resource isolation☆60Updated this week
- Golang bindings for Nvidia Datacenter GPU Manager (DCGM)☆103Updated 2 weeks ago
- LessAPI-DuckDuckGo is an API service for a search engine. Simple, lightweight, reliable, Docker deployable, easy to maintain. 一个基于DuckDuc…☆45Updated 9 months ago
- NVIDIA NCCL Tests for Distributed Training☆79Updated this week
- HTTP based Tree-shaped Peer2Peer blob transfer proxy, distributing images or blob data.☆20Updated 2 years ago
- The storage system of sealos, aims to be a high-performance, high-reliability, and auto-scaling distributed file system☆146Updated last year
- A diverse, simple, and secure all-in-one LLMOps platform☆99Updated 5 months ago
- a huggingface mirror site.☆264Updated 11 months ago
- C++ implementation of Qwen-LM☆577Updated 2 months ago
- ☆61Updated this week
- Hooked CUDA-related dynamic libraries by using automated code generation tools.☆145Updated last year
- 支持中文场景的的小语言模型 llama2.c-zh☆145Updated 11 months ago
- ☆273Updated last year