distantmagic / llmops-handbookLinks

Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniques, and software engineering practices. (work in progress)

☆68

Alternatives and similar repositories for llmops-handbook

Users that are interested in llmops-handbook are comparing it to the libraries listed below

Sorting:

abgulati / hf-waitress
Serving LLMs in the HF-Transformers format via a PyFlask API
☆71Updated 10 months ago
itsPreto / VECTR8
Embed anything.
☆28Updated last year
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 8 months ago
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆72Updated 8 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆183Updated last year
Fus3n / TwoAI
A simple experiment on letting two local LLM have a conversation about anything!
☆110Updated last year
remichu-ai / gallama
☆131Updated 2 months ago
severian42 / Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …
☆185Updated 11 months ago
SomeOddCodeGuy / OfflineWikipediaTextApi
This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …
☆97Updated 3 months ago
mgerstgrasser / tacheles
a lightweight, open-source blueprint for building powerful and scalable LLM chat applications
☆28Updated last year
ddh0 / easy-llama
Python package wrapping llama.cpp for on-device LLM inference
☆75Updated this week
michaelfeil / embed
A stable, fast and easy-to-use inference library with a focus on a sync-to-async API
☆45Updated 9 months ago
abhishekkrthakur / chat-ext
chrome & firefox extension to chat with webpages: local llms
☆119Updated 6 months ago
ComposioHQ / Composio-Function-Calling-Benchmark
Function Calling Benchmark & Testing
☆86Updated last year
severian42 / MoA-Ollama-Chat
This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…
☆117Updated last year
Itachi-Uchiha581 / Auto-Data
Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).
☆102Updated 8 months ago
LostRuins / datasetexplorer
Easily view and modify JSON datasets for large language models
☆77Updated last month
monk1337 / auto-ollama
run ollama & gguf easily with a single command
☆52Updated last year
Rivridis / LLM-Assistant
Locally running LLM with internet access
☆95Updated last week
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 7 months ago
octopus2023-inc / gensphere
Declarative framework to build LLM-based applications
☆120Updated 8 months ago
OoriData / OgbujiPT
Client-side toolkit for using large language models, including where self-hosted
☆111Updated 7 months ago
Forest-Person / smolResearcher
Use smol agents to do research and then update csv coumns with its findings.
☆41Updated 5 months ago
nath1295 / LLMFlex
A python package for developing AI applications with local LLMs.
☆150Updated 6 months ago
tolitius / towel
"a towel is about the most massively useful thing an interstellar AI hitchhiker can have"
☆48Updated 9 months ago
mzbac / mlx-llm-server
For inferring and serving local LLMs using the MLX framework
☆104Updated last year
kolenaIO / autoarena
Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation
☆105Updated 6 months ago
cfahlgren1 / observers
A Lightweight Library for AI Observability
☆246Updated 4 months ago
Aesthisia / LLMinator
Gradio based tool to run opensource LLM models directly from Huggingface
☆93Updated last year
cognitivecomputations / kraken
☆66Updated last year