Large Language Model Hosting Container
☆91Oct 9, 2025Updated 4 months ago
Alternatives and similar repositories for llm-hosting-container
Users that are interested in llm-hosting-container are comparing it to the libraries listed below
Sorting:
- ☆15Oct 24, 2023Updated 2 years ago
- collection of serverless machine learning use cases and examples including Hugging Face transformers, timm, Gradio☆16Dec 16, 2022Updated 3 years ago
- ☆56Jun 26, 2025Updated 8 months ago
- ☆272Apr 23, 2025Updated 10 months ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 5 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- Repository for training and deploying Generative AI models, including text-text, text-to-image generation and prompt engineering playgrou…☆202Feb 27, 2026Updated last week
- Training and inference on AWS Trainium and Inferentia chips.☆261Updated this week
- ☆14Apr 22, 2024Updated last year
- ☆17Mar 30, 2024Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Apr 11, 2024Updated last year
- ☆19Mar 16, 2025Updated 11 months ago
- A universal scalable machine learning model deployment solution☆248Feb 28, 2026Updated last week
- ☆20Apr 7, 2024Updated last year
- CUDA Embedding Lookup Kernel Library☆43Feb 9, 2026Updated 3 weeks ago
- Deployment code for image generative AI and other related image based tasks.☆22May 15, 2023Updated 2 years ago
- Tune MPTs☆84Jun 17, 2023Updated 2 years ago
- CrewAI AgentOps: Monitor your AI Agents☆19Jun 29, 2024Updated last year
- ☆19Jan 11, 2024Updated 2 years ago
- An introduction to global assessment techniques using Python☆12Apr 24, 2023Updated 2 years ago
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆31Jul 14, 2023Updated 2 years ago
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆22Apr 7, 2021Updated 4 years ago
- rmp data ranking☆13Nov 4, 2025Updated 4 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆36Dec 2, 2025Updated 3 months ago
- Chatbot web-applications with LLM, OpenAI API Assistants, LangChain, vector databases, and other AI stuff☆26Feb 24, 2024Updated 2 years ago
- LM Studio: RAG (Retrieval-Augmented Generation) Local LLM vs GPT-4☆21Jan 16, 2024Updated 2 years ago
- Quickly and securely turn any Linux box into a build and deployment assistant☆25Oct 3, 2024Updated last year
- a python package for loadimg and converting images☆29Feb 18, 2026Updated 2 weeks ago
- ☆24Feb 2, 2026Updated last month
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- CodeSage: Code Representation Learning At Scale (ICLR 2024)☆117Oct 27, 2024Updated last year
- ☆110Jan 16, 2025Updated last year
- The Mattermost AI Framework☆29May 29, 2024Updated last year
- Korean deeplearning swear word(딥러닝 기반 욕설/비속어 판별)☆21Apr 23, 2025Updated 10 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- CloudFormation to setup Kubeflow and Sagemaker Operators on EKS☆25May 30, 2023Updated 2 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Oct 19, 2024Updated last year
- Project for Alike Backup, a BDR solution for XenServer, XCP-ng, and Hyper-V virtualization platforms☆12Sep 18, 2024Updated last year