google / gencLinks
☆63Updated 2 months ago
Alternatives and similar repositories for genc
Users that are interested in genc are comparing it to the libraries listed below
Sorting:
- Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.☆100Updated this week
- ☆181Updated last year
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆390Updated 5 months ago
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆55Updated this week
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆152Updated this week
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆191Updated 6 months ago
- Generative AI Language (PaLM2 + Langchain) Workshop sample codes☆78Updated last year
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆156Updated last week
- ☆146Updated 2 weeks ago
- The NVIDIA AIQToolkit UI streamlines interacting with AIQToolkit workflows in an easy-to-use web application.☆53Updated this week
- Repository of model demos using TT-Buda☆63Updated 7 months ago
- ScalarLM - a unified training and inference stack☆93Updated last week
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆209Updated this week
- A specification for OpenInference, a semantic mapping of ML inferences☆47Updated last year
- ☆251Updated last week
- Testing framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)☆65Updated 5 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆113Updated 5 months ago
- AI on GKE is a collection of examples, best-practices, and prebuilt solutions to help build, deploy, and scale AI Platforms on Google Kub…☆325Updated 5 months ago
- ☆132Updated this week
- ☆473Updated last year
- ☆54Updated 2 weeks ago
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.☆128Updated 3 weeks ago
- ☆30Updated this week
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆75Updated 8 months ago
- Architectural Blueprints for RAG Automation: Advanced Document Understanding using Vertex AI Search☆12Updated last year
- Multi-backend recommender systems with Keras 3☆147Updated 3 weeks ago
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆201Updated this week
- Google TPU optimizations for transformers models☆122Updated 10 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆81Updated 9 months ago
- ☆87Updated this week