Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniques, and software engineering practices. (work in progress)
☆80Aug 16, 2024Updated last year
Alternatives and similar repositories for llmops-handbook
Users that are interested in llmops-handbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45May 16, 2024Updated last year
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 7 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Jun 7, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆26May 31, 2024Updated last year
- ☆20Jan 24, 2024Updated 2 years ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year
- ☆27Apr 7, 2025Updated last year
- Run Ollama LLM models in Google Colab for free☆38Nov 24, 2024Updated last year
- AI management tool☆121Nov 9, 2024Updated last year
- An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intellig…☆58Aug 1, 2024Updated last year
- Crow is a Desktop AI Assistant☆32Aug 9, 2024Updated last year
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Sep 5, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆32Mar 26, 2025Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated 3 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Jun 25, 2024Updated last year
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆212Jun 17, 2025Updated 9 months ago
- Multi-Agent LLM System for Digital Scam Protection☆13Dec 19, 2024Updated last year
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- ☆21Oct 6, 2023Updated 2 years ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An interface that features barely zero external dependencies beyond the Ollama API itself, making it lightweight and portable to easily i…☆12Mar 25, 2025Updated last year
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- ☆15Feb 1, 2025Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Mar 6, 2025Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆34Aug 14, 2024Updated last year
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- LiteLLM model integration for Pydantic AI framework - access 100+ LLM providers through a unified interface☆21Nov 19, 2025Updated 4 months ago
- Tag manager and captioner for image datasets☆20Aug 27, 2024Updated last year
- ☆24Feb 2, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- An AI chatbot to help you heal, grow, and awaken with IFS☆15Aug 12, 2024Updated last year
- Forking for the purpose of continuing its development☆22Apr 4, 2024Updated 2 years ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆169May 16, 2024Updated last year
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- Never forget the resource that helps to close that sales call! Power a real-time speech-to-text agent with retrieval augmented generation…☆14Jan 23, 2024Updated 2 years ago
- ☆83Feb 28, 2025Updated last year