google / gencLinks
☆64Updated 5 months ago
Alternatives and similar repositories for genc
Users that are interested in genc are comparing it to the libraries listed below
Sorting:
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆114Updated this week
- ☆189Updated 2 years ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆404Updated last month
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆205Updated this week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆162Updated this week
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64Updated last month
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆162Updated this week
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆57Updated this week
- ☆259Updated 2 months ago
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆202Updated 9 months ago
- Transformer GPU VRAM estimator☆68Updated last year
- ☆152Updated last month
- ☆31Updated 2 weeks ago
- Generative AI Language (PaLM2 + Langchain) Workshop sample codes☆78Updated last year
- Google TPU optimizations for transformers models☆135Updated 2 weeks ago
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆327Updated 4 months ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Updated last month
- Architectural Blueprints for RAG Automation: Advanced Document Understanding using Vertex AI Search☆13Updated last year
- Run GPU inference and training jobs on serverless infrastructure that scales with you.☆102Updated last year
- ☆85Updated 2 months ago
- 🦅🔗 Building FlyteGPT on Flyte with LangChain☆30Updated 2 years ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆82Updated 11 months ago
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 6 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆115Updated 6 months ago
- Command line tool for Deep Infra cloud ML inference service☆34Updated last year
- ☆72Updated last week
- ☆162Updated 11 months ago
- Benchmark suite for LLMs from Fireworks.ai☆89Updated last week
- Tutorial for building LLM router☆244Updated last year
- ☆67Updated 10 months ago