ghostplant / tf-image-generatorLinks
An extremely-fast Tensorflow Native Ops in place of tf.keras.ImageDataGenerator for image data input to GPU device (more simple than tf.dataset.TFRecord)
☆7Updated 7 months ago
Alternatives and similar repositories for tf-image-generator
Users that are interested in tf-image-generator are comparing it to the libraries listed below
Sorting:
- Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“☆62Updated last year
- Microsoft Collective Communication Library☆354Updated last year
- Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation☆27Updated last month
- DeepSeek-V3/R1 inference performance simulator☆164Updated 4 months ago
- ☆192Updated 5 years ago
- Repository for MLCommons Chakra schema and tools☆117Updated last week
- Artifacts for our NSDI'23 paper TGS☆81Updated last year
- NCCL Profiling Kit☆141Updated last year
- An experimental parallel training platform☆54Updated last year
- Fine-grained GPU sharing primitives☆143Updated 2 weeks ago
- Compiler for Dynamic Neural Networks☆46Updated last year
- An interference-aware scheduler for fine-grained GPU sharing☆143Updated 6 months ago
- This repository is an archive. Refer to https://github.com/gvirtus/GVirtuS☆45Updated 3 years ago
- Helios Traces from SenseTime☆56Updated 2 years ago
- Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020☆127Updated last year
- Ultra and Unified CCL☆468Updated this week
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆191Updated 2 weeks ago
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling☆60Updated last year
- MSCCL++: A GPU-driven communication stack for scalable AI applications☆394Updated this week
- Intercepting CUDA runtime calls with LD_PRELOAD☆41Updated 11 years ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆84Updated 2 years ago
- ☆18Updated 2 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆120Updated last year
- Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO☆31Updated last year
- The source code of INFless,a native serverless platform for AI inference.☆40Updated 2 years ago
- Microsoft Collective Communication Library☆65Updated 8 months ago
- Integrated Training Platform (ITP) traces used in ElasticFlow paper.☆29Updated 2 years ago
- High performance Transformer implementation in C++.☆129Updated 6 months ago
- RDMA and SHARP plugins for nccl library☆200Updated last month
- Thunder Research Group's Collective Communication Library☆39Updated last month