philschmid / terraform-aws-sagemaker-huggingface
☆46Updated 6 months ago
Related projects: ⓘ
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆31Updated last year
- ☆41Updated 4 months ago
- ☆56Updated this week
- ☆57Updated 2 years ago
- Large Language Model Hosting Container☆75Updated 2 weeks ago
- ☆62Updated 2 months ago
- A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector database☆49Updated last month
- ☆30Updated 7 months ago
- ☆21Updated 5 months ago
- ☆32Updated last year
- ☆41Updated 6 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆113Updated 7 months ago
- Fast model deployment on AWS Lambda☆14Updated 6 months ago
- Deploy llama.cpp compatible Generative AI LLMs on AWS Lambda!☆165Updated 5 months ago
- Zero administration inference with AWS Lambda for 🤗☆62Updated 2 years ago
- ☆17Updated 10 months ago
- ☆34Updated 2 years ago
- A PaaS End-to-End ML Setup with Metaflow, Serverless and SageMaker.☆36Updated 3 years ago
- Retrieval Augmented Generation applications☆27Updated 11 months ago
- Plugin for https://llm.datasette.io/en/stable/ to enable talking with Claude Instant and ClaudeV2 models on AWS Bedrock☆30Updated last week
- Generative AI on AWS Immersion Day☆49Updated 5 months ago
- A set of Docker images that include popular frameworks for machine learning, data science and visualization.☆89Updated this week
- A pipeline to convert contextual knowledge stored in documents and databases into text embeddings, and store them in a vector store☆14Updated 9 months ago
- Use LLMs for building real-world apps☆107Updated 6 months ago
- A reference implementation of an end to end, open-source MLOps platform.☆13Updated last year
- BigBertha is an architecture design that demonstrates how automated LLMOps (Large Language Models Operations) can be achieved on any Kube…☆26Updated 10 months ago
- ☆86Updated last year
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆193Updated this week
- MLOps End-to-End Example using Amazon SageMaker Pipeline, AWS CodePipeline and AWS CDK☆112Updated this week
- AWS SageMaker, SeldonCore, KServe, Kubeflow & MLflow, VectorDB☆26Updated 5 months ago