the-ai-merge / production-hubLinks
Hands-on hub to learn techniques to optimize and serve AI models to production the most optimal way.
☆13Updated 4 months ago
Alternatives and similar repositories for production-hub
Users that are interested in production-hub are comparing it to the libraries listed below
Sorting:
- Fine-tune an LLM to perform batch inference and online serving.☆115Updated 7 months ago
- A collection of hand on notebook for LLMs practitioner☆51Updated 11 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 9 months ago
- Fine tune Gemma 3 on an object detection task☆95Updated 5 months ago
- Notebooks to demonstrate TimmWrapper☆16Updated 11 months ago
- Multimodal AI workloads: batch inference, model training and online serving.☆105Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Notebooks for fine tuning pali gemma☆117Updated 8 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- Table detection with Florence.☆15Updated last year
- 100 Days of GPU Challenge☆24Updated last month
- zero-to-lightning☆31Updated last year
- ☆125Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆49Updated last year
- A template to kick-start your Python project ✨🚀☆53Updated 5 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated this week
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Updated last year
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectiv…☆74Updated 4 months ago
- Examples of using Evidently to evaluate, test and monitor ML models.☆47Updated 3 weeks ago
- 🤝 Trade any tensors over the network☆30Updated 2 years ago
- ☆20Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 3 months ago
- ☆18Updated last year
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆32Updated 11 months ago
- Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and fairness analysis.☆26Updated this week
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆14Updated last year
- 💻 Decoding ML articles hub: Hands-on articles with code on production-grade ML☆140Updated 10 months ago
- Chunk your text using gpt4o-mini more accurately☆44Updated last year