distantmagic / llmops-handbook
Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniques, and software engineering practices. (work in progress)
☆59Updated 6 months ago
Alternatives and similar repositories for llmops-handbook:
Users that are interested in llmops-handbook are comparing it to the libraries listed below
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 4 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆62Updated 4 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆26Updated last month
- ☆124Updated last month
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 5 months ago
- A fast batching API to serve LLM models☆180Updated 10 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆176Updated 7 months ago
- ☆78Updated 2 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆30Updated 8 months ago
- Stateful control of Large Language Models☆112Updated this week
- Serving LLMs in the HF-Transformers format via a PyFlask API☆70Updated 5 months ago
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆58Updated 7 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆104Updated 8 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 10 months ago
- Easily view and modify JSON datasets for large language models☆71Updated last week
- Text generation in Python, as easy as possible☆54Updated this week
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- Simple examples using Argilla tools to build AI☆53Updated 3 months ago
- Embed anything.☆29Updated 9 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆77Updated last month
- AI management tool☆113Updated 3 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆50Updated last month
- freeact is a lightweight library for code-action based agents☆69Updated this week
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 6 months ago
- Declarative framework to build LLM-based applications☆115Updated 3 months ago
- ☆39Updated last year
- Locally running LLM with internet access☆94Updated 4 months ago
- Generate python documentation using LLMs☆62Updated 8 months ago
- run ollama & gguf easily with a single command☆49Updated 9 months ago
- Extract structured data from local or remote LLM models☆40Updated 8 months ago