distantmagic / llmops-handbook
Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniques, and software engineering practices. (work in progress)
☆58Updated 5 months ago
Alternatives and similar repositories for llmops-handbook:
Users that are interested in llmops-handbook are comparing it to the libraries listed below
- Serving LLMs in the HF-Transformers format via a PyFlask API☆69Updated 4 months ago
- Text generation in Python, as easy as possible☆51Updated this week
- ☆74Updated last month
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆44Updated 4 months ago
- run ollama & gguf easily with a single command☆49Updated 8 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆56Updated 3 months ago
- Embed anything.☆28Updated 8 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆101Updated 6 months ago
- ☆121Updated last week
- A fast batching API to serve LLM models☆180Updated 9 months ago
- ☆65Updated 8 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆75Updated this week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆63Updated 2 months ago
- Complex RAG backend☆28Updated 10 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆168Updated 6 months ago
- An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intellig…☆48Updated 5 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated 9 months ago
- Easily view and modify JSON datasets for large language models☆69Updated 3 months ago
- Make Llama 3.1 8B talk in Rick Sanchez’s style☆33Updated last week
- A discovery and compression tool for your Python codebase. Creates a knowledge graph for a LLM context window, efficiently outlining your…☆71Updated last month
- A python package for developing AI applications with local LLMs.☆144Updated 3 weeks ago
- ☆39Updated 11 months ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated 3 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆232Updated 8 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆167Updated 8 months ago
- Experimental LLM Inference UX to aid in creative writing☆111Updated last month
- ☆109Updated last month
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆30Updated 7 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆67Updated 4 months ago