awslabs / llm-hosting-containerView external linksLinks
Large Language Model Hosting Container
☆91Oct 9, 2025Updated 4 months ago
Alternatives and similar repositories for llm-hosting-container
Users that are interested in llm-hosting-container are comparing it to the libraries listed below
Sorting:
- This guide helps you create an automated Earth Observation pipeline on AWS.☆23Oct 24, 2024Updated last year
- collection of serverless machine learning use cases and examples including Hugging Face transformers, timm, Gradio☆16Dec 16, 2022Updated 3 years ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆15Feb 15, 2023Updated 3 years ago
- ☆271Apr 23, 2025Updated 9 months ago
- ☆32Jul 5, 2024Updated last year
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- Orchestrate Modal and OpenAI workloads with Dagster☆13Dec 11, 2024Updated last year
- Training and inference on AWS Trainium and Inferentia chips.☆259Updated this week
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated 10 months ago
- ☆14Apr 22, 2024Updated last year
- ☆17Mar 30, 2024Updated last year
- AI Starter Kit for AI applications in Drone technology using Intel® Optimized Tensorflow*☆18May 8, 2024Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Apr 11, 2024Updated last year
- Code for "Merging Text Transformers from Different Initializations"☆20Feb 2, 2025Updated last year
- ☆19Mar 16, 2025Updated 10 months ago
- ☆20Apr 7, 2024Updated last year
- Deployment code for image generative AI and other related image based tasks.☆22May 15, 2023Updated 2 years ago
- paddle code convert toolkit☆23Mar 19, 2023Updated 2 years ago
- A tool for extracting files from Apple OTA updates as a tarball.☆17Sep 13, 2017Updated 8 years ago
- CrewAI AgentOps: Monitor your AI Agents☆19Jun 29, 2024Updated last year
- Example code using the DSPy framework.☆20May 30, 2024Updated last year
- ☆22Aug 18, 2023Updated 2 years ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆33Dec 2, 2025Updated 2 months ago
- An introduction to global assessment techniques using Python☆12Apr 24, 2023Updated 2 years ago
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆22Apr 7, 2021Updated 4 years ago
- Metaflow flows for analyzing topics and sentiments in Hacker News☆22Aug 13, 2024Updated last year
- ☆19Jan 11, 2024Updated 2 years ago
- ☆22Oct 18, 2023Updated 2 years ago
- Slides and notebook for the workshop on building a search system☆23Mar 17, 2024Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- Learning PyTorch through the D2L book. A series of notebooks for the same☆28Jun 30, 2022Updated 3 years ago
- Spring AI support for latest watsonx.ai services☆22Updated this week
- CodeSage: Code Representation Learning At Scale (ICLR 2024)☆116Oct 27, 2024Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- TypeSan checks casts in C++ code - code released for CCS 2016☆36May 5, 2021Updated 4 years ago
- Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta☆125Feb 6, 2026Updated last week
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Oct 19, 2024Updated last year