google / genc
☆59Updated 2 months ago
Alternatives and similar repositories for genc:
Users that are interested in genc are comparing it to the libraries listed below
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆274Updated this week
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆34Updated last week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆103Updated this week
- A minimalistic C++ Jinja templating engine for LLM chat templates☆120Updated last week
- PyTorch per step fault tolerance (actively under development)☆243Updated this week
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆51Updated last week
- 🦅🔗 Building FlyteGPT on Flyte with LangChain☆29Updated last year
- ☆133Updated last year
- Google TPU optimizations for transformers models☆98Updated 3 weeks ago
- Generative AI Language (PaLM2 + Langchain) Workshop sample codes☆72Updated 9 months ago
- Transformer GPU VRAM estimator☆49Updated 10 months ago
- Self-host LLMs with vLLM and BentoML☆86Updated this week
- Benchmarking suite for popular AI APIs☆80Updated last week
- ☆184Updated this week
- ☆132Updated 2 weeks ago
- End-to-End LLM Guide☆101Updated 7 months ago
- ☆197Updated 3 weeks ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆170Updated this week
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆141Updated 3 weeks ago
- ☆30Updated last month
- Benchmark suite for LLMs from Fireworks.ai☆66Updated last week
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆35Updated this week
- ☆52Updated 5 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week
- ☆11Updated 2 weeks ago
- ☆84Updated this week
- ☆14Updated last month
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆106Updated 2 months ago
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆157Updated last week
- Home for OctoML PyTorch Profiler☆107Updated last year