DevSecOpsSamples / eks-gpu-autoscalingLinks
GPU Auto Scaling based on Prometheus custom metric on EKS
☆15Updated 2 years ago
Alternatives and similar repositories for eks-gpu-autoscaling
Users that are interested in eks-gpu-autoscaling are comparing it to the libraries listed below
Sorting:
- Dockerfile and web server for running GPT-J-6B on AWS GPU instances☆18Updated 3 years ago
- ☆13Updated last year
- ☆28Updated last year
- Fast model deployment on AWS EC2☆14Updated last year
- Run code-llama with 50k tokens using flash attention and better transformer☆12Updated last year
- ☆49Updated last year
- ☆20Updated last month
- BIG: Back In the Game of Creative AI☆27Updated 2 years ago
- Fast model deployment on AWS Lambda☆14Updated last year
- An example of how to create finetuning datasets and work with other models than Alpaca☆10Updated 2 years ago
- ☆22Updated last year
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆37Updated last year
- Data extraction from documents with ML (research and experimental code repo)☆16Updated 2 years ago
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Updated last year
- A pipeline to convert contextual knowledge stored in documents and databases into text embeddings, and store them in a vector store☆16Updated last month
- CI/CD pipeline with Amazon SageMaker and Github actions☆23Updated 2 years ago
- ☆16Updated 11 months ago
- A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector database☆55Updated 10 months ago
- Deploy and Scale LLM-based applications☆26Updated last year
- This repository contains an example of how to use the Weaviate vector search engine's text2vec-openai module☆29Updated 2 years ago
- This repository aims to showcase how to finetune a FM model in Amazon EKS cluster using, JupyterHub to provision notebooks and craft both…☆45Updated 3 weeks ago
- AWS SageMaker, SeldonCore, KServe, Kubeflow & MLflow, VectorDB☆33Updated last year
- Amazon ECS Auto Scaling for GPU-based Machine Learning Workloads☆17Updated last year
- ☆44Updated last year
- This repository contains the source code for running llamaindex tutorials from https://howaibuildthis.substack.com/☆40Updated last year
- This repository contains samples for fine-tuning embedding models using Amazon SageMaker. Embedding models are useful for tasks such as s…☆12Updated 3 months ago
- This repository is part of a blog post that guides users through creating a visual search application using Amazon SageMaker and Amazon E…☆11Updated last year
- Apps that run on modal.com☆12Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- ☆34Updated 2 years ago