distantmagic / llmops-handbook
Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniques, and software engineering practices. (work in progress)
☆62Updated 7 months ago
Alternatives and similar repositories for llmops-handbook:
Users that are interested in llmops-handbook are comparing it to the libraries listed below
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 6 months ago
- A fast batching API to serve LLM models☆183Updated 11 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆90Updated 2 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 5 months ago
- ☆66Updated 10 months ago
- Embed anything.☆29Updated 10 months ago
- Let's create synthetic textbooks together :)☆74Updated last year
- Easily view and modify JSON datasets for large language models☆72Updated last month
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated 9 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆178Updated 8 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆65Updated 5 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆107Updated 9 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 6 months ago
- ☆128Updated last week
- Client-side toolkit for using large language models, including where self-hosted☆107Updated 4 months ago
- ☆28Updated 6 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 9 months ago
- run ollama & gguf easily with a single command☆50Updated 10 months ago
- Generate python documentation using LLMs☆63Updated 9 months ago
- MockLLM, when you want it to do what you tell it to do!☆46Updated this week
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆64Updated 5 months ago
- A Lightweight Library for AI Observability☆239Updated last month
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated 11 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 6 months ago
- Experimental LLM Inference UX to aid in creative writing☆114Updated 3 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆65Updated 5 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆117Updated 5 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆55Updated last month
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆71Updated 6 months ago
- A frontend for creative writing with LLMs☆122Updated 8 months ago