distantmagic / llmops-handbookLinks
Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniques, and software engineering practices. (work in progress)
☆68Updated 10 months ago
Alternatives and similar repositories for llmops-handbook
Users that are interested in llmops-handbook are comparing it to the libraries listed below
Sorting:
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 10 months ago
- Embed anything.☆28Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 8 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆72Updated 8 months ago
- A fast batching API to serve LLM models☆183Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆110Updated last year
- ☆131Updated 2 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated 11 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆97Updated 3 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆75Updated this week
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 9 months ago
- chrome & firefox extension to chat with webpages: local llms☆119Updated 6 months ago
- Function Calling Benchmark & Testing☆86Updated last year
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆117Updated last year
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated 8 months ago
- Easily view and modify JSON datasets for large language models☆77Updated last month
- run ollama & gguf easily with a single command☆52Updated last year
- Locally running LLM with internet access☆95Updated last week
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- Declarative framework to build LLM-based applications☆120Updated 8 months ago
- Client-side toolkit for using large language models, including where self-hosted☆111Updated 7 months ago
- Use smol agents to do research and then update csv coumns with its findings.☆41Updated 5 months ago
- A python package for developing AI applications with local LLMs.☆150Updated 6 months ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated 9 months ago
- For inferring and serving local LLMs using the MLX framework☆104Updated last year
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆105Updated 6 months ago
- A Lightweight Library for AI Observability☆246Updated 4 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆93Updated last year
- ☆66Updated last year