google / genc
☆61Updated 4 months ago
Alternatives and similar repositories for genc:
Users that are interested in genc are comparing it to the libraries listed below
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆51Updated this week
- A minimalistic C++ Jinja templating engine for LLM chat templates☆128Updated this week
- ☆218Updated last month
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆107Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆181Updated this week
- Generative AI Language (PaLM2 + Langchain) Workshop sample codes☆72Updated 11 months ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆301Updated this week
- ☆137Updated this week
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆155Updated 3 weeks ago
- Google TPU optimizations for transformers models☆104Updated 2 months ago
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆143Updated 2 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆32Updated 2 weeks ago
- ☆30Updated 3 months ago
- ☆27Updated last week
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆40Updated this week
- ScalarLM - a unified training and inference stack☆31Updated last week
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64Updated 4 months ago
- ☆141Updated last year
- ☆26Updated last week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated this week
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆108Updated 3 weeks ago
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆41Updated last week
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆55Updated last week
- Machine learning for machine code.☆84Updated this week
- Repository of model demos using TT-Buda☆63Updated this week
- ☆176Updated this week
- Notes and artifacts from the ONNX steering committee☆25Updated this week
- Transformer GPU VRAM estimator☆58Updated last year
- Slides and recordings of talks hosted by our community☆20Updated 9 months ago
- ☆38Updated this week