FlyAIBox / dcu-in-actionLinks
国产加速卡-海光DCU实战(大模型训练、微调、推理 等)
☆29Updated this week
Alternatives and similar repositories for dcu-in-action
Users that are interested in dcu-in-action are comparing it to the libraries listed below
Sorting:
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆28Updated 2 months ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆32Updated 2 years ago
- Device-plugin for volcano vgpu which support hard resource isolation☆91Updated last week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆24Updated 6 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆32Updated this week
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Updated last year
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆30Updated 6 months ago
- ☆54Updated 7 months ago
- A simple, High-Performance, Scalable ML/DL Models Repository based on OCI Artifacts☆33Updated last year
- Bitfusion with Kubernetes Integration Support☆50Updated last year
- A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.☆20Updated 3 years ago
- d.run website☆16Updated this week
- This repository contains statistics about the AI Infrastructure products.☆18Updated 3 months ago
- A distributed engine for intelligent workload☆26Updated 4 months ago
- Backend server for envd☆21Updated last year
- A stress testing tool for the scheduler in a large-scale scenario.☆15Updated last year
- Distributed KV cache coordinator☆36Updated this week
- 🎉 An awesome & curated list of best LLMOps tools.☆122Updated last week
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆20Updated 2 months ago
- Device plugins for Volcano, e.g. GPU☆124Updated 3 months ago
- A Cloud-Native Service Catalog and Full Lifecycle Management Platform accross Multi-cloud and Edge☆34Updated last year
- This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.☆47Updated 3 months ago
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆12Updated 3 weeks ago
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆67Updated last month
- A kubernetes plugin which enables dynamically add or remove GPU resources for a running Pod☆125Updated 3 years ago
- 博客☆21Updated last month
- Deploy ChatGLM on Modelz☆15Updated 2 years ago
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆17Updated 3 weeks ago
- Large language model fine-tuning capabilities based on cloud native and distributed computing.☆92Updated last year
- The Volcano Descheduler☆16Updated 5 months ago