distantmagic / llmops-handbook
Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniques, and software engineering practices. (work in progress)
☆64Updated 8 months ago
Alternatives and similar repositories for llmops-handbook:
Users that are interested in llmops-handbook are comparing it to the libraries listed below
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 5 months ago
- Embed anything.☆29Updated 11 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 7 months ago
- ☆130Updated last week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆67Updated 5 months ago
- Generate python documentation using LLMs☆66Updated 9 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆90Updated 3 weeks ago
- ☆39Updated last year
- run ollama & gguf easily with a single command☆50Updated 11 months ago
- Simple examples using Argilla tools to build AI☆52Updated 5 months ago
- A fast batching API to serve LLM models☆182Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 7 months ago
- Text generation in Python, as easy as possible☆58Updated this week
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated 10 months ago
- ☆66Updated 11 months ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated 6 months ago
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆124Updated last week
- Client-side toolkit for using large language models, including where self-hosted☆109Updated 5 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆110Updated 9 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 10 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆60Updated 8 months ago
- ☆28Updated 6 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆179Updated 9 months ago
- ☆85Updated 4 months ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆31Updated 9 months ago
- Function Calling Benchmark & Testing☆87Updated 9 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- ☆17Updated 4 months ago
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆41Updated 7 months ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆103Updated 4 months ago