DevSecOpsSamples / eks-gpu-autoscalingLinks

GPU Auto Scaling based on Prometheus custom metric on EKS

☆15

Alternatives and similar repositories for eks-gpu-autoscaling

Users that are interested in eks-gpu-autoscaling are comparing it to the libraries listed below

Sorting:

spullara / gpt-j-6b
Dockerfile and web server for running GPT-J-6B on AWS GPU instances
☆18Updated 3 years ago
aws-samples / best-practices-for-fastapi-on-inferentia
☆13Updated last year
philschmid / amazon-sagemaker-gpt-j-sample
☆28Updated last year
bentoml / aws-ec2-deploy
Fast model deployment on AWS EC2
☆14Updated last year
TrelisResearch / code-llama-32k
Run code-llama with 50k tokens using flash attention and better transformer
☆12Updated last year
philschmid / terraform-aws-sagemaker-huggingface
☆49Updated last year
aws-samples / Mistral-7B-Instruct-fine-tune-and-deploy-on-SageMaker
☆20Updated last month
jina-ai / big_creative_ai
BIG: Back In the Game of Creative AI
☆27Updated 2 years ago
bentoml / aws-lambda-deploy
Fast model deployment on AWS Lambda
☆14Updated last year
Aemon-Algiz / LoRA-Finetuning-Example
An example of how to create finetuning datasets and work with other models than Alpaca
☆10Updated 2 years ago
aws-samples / sagemaker-vector-store-microservice
☆22Updated last year
wenqiglantz / nemo-guardrails-llamaindex-rag
Adding NeMo Guardrails to a LlamaIndex RAG pipeline
☆37Updated last year
katanaml / sparrow-research
Data extraction from documents with ML (research and experimental code repo)
☆16Updated 2 years ago
wenqiglantz / text-embedding-inference-server-edd
Experimenting text-embeddings-inference server on both CPU and GPU
☆18Updated last year
aws-samples / text-embeddings-pipeline-for-rag
A pipeline to convert contextual knowledge stored in documents and databases into text embeddings, and store them in a vector store
☆16Updated last month
haythemtellili / amazon-sagemaker-cicd
CI/CD pipeline with Amazon SageMaker and Github actions
☆23Updated 2 years ago
aws-samples / genai-llm-cpu-sagemaker
☆16Updated 11 months ago
weaviate / st-weaviate-connection
A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector database
☆55Updated 10 months ago
ray-project / llms-in-prod-workshop-2023
Deploy and Scale LLM-based applications
☆26Updated last year
weaviate / DEMO-text2vec-openai
This repository contains an example of how to use the Weaviate vector search engine's text2vec-openai module
☆29Updated 2 years ago
aws-samples / gen-ai-on-eks
This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…
☆45Updated 3 weeks ago
vinayak-shanawad / AI-ML-Projects
AWS SageMaker, SeldonCore, KServe, Kubeflow & MLflow, VectorDB
☆33Updated last year
aws-samples / ecs-gpu-scaling
Amazon ECS Auto Scaling for GPU-based Machine Learning Workloads
☆17Updated last year
aws-samples / sagemaker-hosting
☆44Updated last year
nguyenkien1402 / llamaindex-practices
This repository contains the source code for running llamaindex tutorials from https://howaibuildthis.substack.com/
☆40Updated last year
aws-samples / fine-tune-embedding-models-on-sagemaker
This repository contains samples for fine-tuning embedding models using Amazon SageMaker. Embedding models are useful for tasks such as s…
☆12Updated 3 months ago
aws-samples / amazon-sagemaker-visual-search
This repository is part of a blog post that guides users through creating a visual search application using Amazon SageMaker and Amazon E…
☆11Updated last year
nateraw / modal-examples
Apps that run on modal.com
☆12Updated last year
TuanaCelik / unstructuredio-haystack
💙 Unstructured Data Connectors for Haystack 2.0
☆16Updated last year
aws-samples / sagemaker-mlops-with-terraform
☆34Updated 2 years ago