awslabs / llm-hosting-containerView external linksLinks
Large Language Model Hosting Container
☆91Oct 9, 2025Updated 4 months ago
Alternatives and similar repositories for llm-hosting-container
Users that are interested in llm-hosting-container are comparing it to the libraries listed below
Sorting:
- ☆15Oct 24, 2023Updated 2 years ago
- This guide helps you create an automated Earth Observation pipeline on AWS.☆23Oct 24, 2024Updated last year
- ☆56Jun 26, 2025Updated 7 months ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆15Feb 15, 2023Updated 2 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- Orchestrate Modal and OpenAI workloads with Dagster☆13Dec 11, 2024Updated last year
- Training and inference on AWS Trainium and Inferentia chips.☆259Feb 6, 2026Updated last week
- ☆14Apr 22, 2024Updated last year
- ☆17Mar 30, 2024Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Apr 11, 2024Updated last year
- CUDA Embedding Lookup Kernel Library☆42Updated this week
- A tool for extracting files from Apple OTA updates as a tarball.☆17Sep 13, 2017Updated 8 years ago
- ☆82Apr 16, 2024Updated last year
- Multimodal Chat with Gemini API☆47Dec 25, 2023Updated 2 years ago
- CrewAI AgentOps: Monitor your AI Agents☆19Jun 29, 2024Updated last year
- Example code using the DSPy framework.☆20May 30, 2024Updated last year
- ☆22Feb 2, 2026Updated last week
- ☆22Aug 18, 2023Updated 2 years ago
- ☆19Jan 11, 2024Updated 2 years ago
- LM Studio: RAG (Retrieval-Augmented Generation) Local LLM vs GPT-4☆21Jan 16, 2024Updated 2 years ago
- Build an LLM powered Ask the Data App with LangChain (using the Pandas DataFrame Agent) and Streamlit☆28Nov 14, 2023Updated 2 years ago
- Quickly and securely turn any Linux box into a build and deployment assistant☆25Oct 3, 2024Updated last year
- a python package for loadimg and converting images☆29Jul 22, 2025Updated 6 months ago
- Implementation of the Mamba SSM with hf_integration.☆55Aug 31, 2024Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆246Nov 3, 2024Updated last year
- Ingest PDFs into Weaviate☆33Jun 14, 2024Updated last year
- Spring AI support for latest watsonx.ai services☆22Updated this week
- ☆25Aug 10, 2018Updated 7 years ago
- ☆111Jan 16, 2025Updated last year
- Store and serve language model prompts☆29Jul 26, 2023Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- TypeSan checks casts in C++ code - code released for CCS 2016☆36May 5, 2021Updated 4 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆125Feb 6, 2026Updated last week
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Oct 19, 2024Updated last year
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- Project for Alike Backup, a BDR solution for XenServer, XCP-ng, and Hyper-V virtualization platforms☆12Sep 18, 2024Updated last year
- Rice Crop Yield Estimation Using Satellite Data - EY Open Science Data Challenge 2023☆10Jul 25, 2023Updated 2 years ago
- Super Mario is a legendary game we all cherish! In this project, we will deploy Super Mario on Amazon EKS (Elastic Kubernetes Service) us…☆11Feb 3, 2026Updated last week