sofianhamiti / amazon-ecs-nvidia-triton-cdk
Step-by-step guide to Triton deployment on ECS using CDK
☆11Updated 3 years ago
Related projects: ⓘ
- Cortex-compatible model server for Python and TensorFlow☆16Updated last year
- A file utility for accessing both local and remote files through a unified interface.☆36Updated last month
- ☆21Updated 5 months ago
- Example code for deploying GPU workloads on ECS☆19Updated 5 years ago
- ☆46Updated 6 months ago
- Plugin for https://llm.datasette.io/en/stable/ to enable talking with Claude Instant and ClaudeV2 models on AWS Bedrock☆30Updated 2 weeks ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆26Updated last year
- Use LLM to help you with PR review☆67Updated last year
- ☆47Updated 3 weeks ago
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆193Updated this week
- ☆94Updated this week
- Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow☆55Updated 10 months ago
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- Creating Generative AI Apps which work☆16Updated 2 months ago
- Fine-tune Mistral 7B to generate fashion style suggestions☆30Updated 8 months ago
- Large Language Model Hosting Container☆75Updated 2 weeks ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆82Updated 6 months ago
- Fast model deployment on AWS Lambda☆14Updated 6 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated 8 months ago
- ☆14Updated 5 months ago
- ☆57Updated 2 years ago
- Vector Database with support for late interaction and token level embeddings.☆51Updated last week
- ☆23Updated 6 months ago
- ☆20Updated 10 months ago
- gpu tester detects broken and slow gpus in a cluster☆63Updated last year
- ☆56Updated this week
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆27Updated 3 weeks ago
- collection of serverless machine learning use cases and examples including Hugging Face transformers, timm, Gradio☆14Updated last year
- Fast model deployment on AWS EC2☆14Updated 6 months ago
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆13Updated 8 months ago