multi-modal-ai / production-hubLinks

Hands-on hub to learn techniques to optimize and serve AI models to production the most optimal way.

☆13

Alternatives and similar repositories for production-hub

Users that are interested in production-hub are comparing it to the libraries listed below

Sorting:

mrmaheshrajput / productionizing-llms
Code Repository for Blog - How to Productionize Large Language Models (LLMs)
☆12Updated last year
ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆38Updated last year
ashishpatel26 / ai-tutor-rag-system
This is a repository for the course "From Beginner to LLM Developer" by Towards AI.
☆12Updated 11 months ago
anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆114Updated 6 months ago
wandb / eval-course
☆26Updated 3 months ago
ariG23498 / timm-wrapper-examples
Notebooks to demonstrate TimmWrapper
☆16Updated 10 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆51Updated last year
AntonioGr7 / pratical-llms
A collection of hand on notebook for LLMs practitioner
☆51Updated 10 months ago
ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference 🚀
☆48Updated last year
Logisx / AI-Senior
🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.
☆17Updated last year
evidentlyai / community-examples
Examples of using Evidently to evaluate, test and monitor ML models.
☆43Updated 2 weeks ago
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆32Updated 2 months ago
ariG23498 / fine-tune-paligemma
Notebooks for fine tuning pali gemma
☆117Updated 7 months ago
GURPREETKAURJETHRA / RAG-Based-LLM-Chatbot
RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…
☆14Updated 11 months ago
GokuMohandas / testing-ml
Learn how to create reliable ML systems by testing code, data and models.
☆89Updated 3 years ago
Sayandip170900 / CUDA-Challenge
100 Days of GPU Challenge
☆24Updated 3 weeks ago
NielsRogge / awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
☆48Updated last year
ngtranminhtuan / LLMOPS
NLP/LLM Mlops Pipeline to dev/train/evaluation, scalable deploy and monitoring systems.
☆22Updated last year
davanstrien / data-for-fine-tuning-llms
☆80Updated last year
enguard-ai / awesome-ai-guardrails
A curated list of materials on AI guardrails
☆43Updated 6 months ago
ishandutta0098 / zero-to-lightning
zero-to-lightning
☆31Updated last year
aniketmaurya / python-project-template
A template to kick-start your Python project ✨🚀
☆53Updated 4 months ago
jjovalle99 / agentic-design-patterns
☆14Updated last year
CVxTz / llm-serve-tutorial
☆20Updated last year
chainyo / tensorshare
🤝 Trade any tensors over the network
☆30Updated 2 years ago
huggingface / competitions
☆124Updated last year
Paulescu / testing-llms-in-the-real-world
Test LLMs automatically with Giskard and CI/CD
☆31Updated last year
rasbt / RAGs
RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems
☆140Updated 10 months ago
ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆91Updated 4 months ago
anyscale / multimodal-ai
Multimodal AI workloads: batch inference, model training and online serving.
☆103Updated 3 months ago