DataTunerX / datatunerxLinks
Large language model fine-tuning capabilities based on cloud native and distributed computing.
☆91Updated last year
Alternatives and similar repositories for datatunerx
Users that are interested in datatunerx are comparing it to the libraries listed below
Sorting:
- Federated middleware based on Karmada☆48Updated last year
- Using CRDs to manage GPU resources in Kubernetes.☆209Updated 2 years ago
- Device plugins for Volcano, e.g. GPU☆128Updated 5 months ago
- A federation scheduler for multi-cluster☆51Updated 2 months ago
- Device-plugin for volcano vgpu which support hard resource isolation☆102Updated 2 months ago
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆246Updated this week
- Ferry is a Kubernetes multi-cluster communication component that eliminates communication differences between clusters as if they were in…☆104Updated 2 years ago
- Kubernetes LTS(long term support)☆217Updated 8 months ago
- HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container☆210Updated last week
- Underlay and RDMA network solution of the Kubernetes, for bare metal, VM and any public cloud☆604Updated this week
- ☆121Updated 2 years ago
- The API (CRD) of Volcano☆44Updated last month
- A Cloud-Native Service Catalog and Full Lifecycle Management Platform accross Multi-cloud and Edge☆32Updated last year
- OpenCIDN (Open Container Image Deliver Network)☆32Updated 2 months ago
- The limitless expansion of Kubernetes. Make Kubernetes without boundaries☆231Updated 2 months ago
- Kubeflow helm chart☆145Updated 2 years ago
- ZettaStor DBS provides enterprise-level business storage solutions with high availability, high performance, easy expansion and easy main…☆75Updated 8 months ago
- Fast is a Kubernetes CNI based on eBPF implementation☆36Updated last year
- MirageDebug: Local remote debugging for Kubernetes apps, enabling fully authentic environment debugging.☆56Updated last year
- data plane testing utility of cloud native☆223Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆561Updated 2 weeks ago
- d.run website☆15Updated last week
- Unified resource orchestration, unified scheduling, unified traffic management and unified telemetry for distributed cloud☆251Updated last month
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆530Updated last year
- Kubernetes Operator for AI and Bigdata Elastic Training☆88Updated 8 months ago
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/☆12Updated last week
- ☆293Updated last week
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆32Updated last week
- katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This repo…☆47Updated last week
- ☆12Updated this week