google / gencLinks
☆63Updated 6 months ago
Alternatives and similar repositories for genc
Users that are interested in genc are comparing it to the libraries listed below
Sorting:
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆67Updated this week
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆335Updated this week
- Generative AI Language (PaLM2 + Langchain) Workshop sample codes☆76Updated last year
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆145Updated last month
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆165Updated last month
- ☆148Updated last year
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆64Updated 2 weeks ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆153Updated 3 weeks ago
- ☆138Updated 2 weeks ago
- ☆149Updated 2 weeks ago
- Transformer GPU VRAM estimator☆64Updated last year
- Machine learning for machine code.☆90Updated last week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆123Updated this week
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆46Updated this week
- ☆30Updated this week
- ☆16Updated this week
- ☆29Updated 3 weeks ago
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆60Updated 2 months ago
- ☆33Updated 5 months ago
- Benchmarking suite for popular AI APIs☆86Updated 4 months ago
- 🦅🔗 Building FlyteGPT on Flyte with LangChain☆29Updated last year
- Cray-LM unified training and inference stack.☆22Updated 4 months ago
- ☆226Updated 3 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆109Updated 3 weeks ago
- General policies for MLPerf™ including submission rules, coding standards, etc.☆28Updated this week
- ☆55Updated last month
- ☆80Updated 9 months ago
- Self-host LLMs with vLLM and BentoML☆114Updated last week
- ☆123Updated 7 months ago
- Architectural Blueprints for RAG Automation: Advanced Document Understanding using Vertex AI Search☆12Updated last year