intentee / llmops-handbookLinks
Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniques, and software engineering practices. (work in progress)
☆77Updated last year
Alternatives and similar repositories for llmops-handbook
Users that are interested in llmops-handbook are comparing it to the libraries listed below
Sorting:
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆81Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆192Updated last year
- ☆134Updated last month
- chrome & firefox extension to chat with webpages: local llms☆130Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- Embed anything.☆27Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- A Lightweight Library for AI Observability☆253Updated 10 months ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆103Updated last year
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆101Updated 4 months ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆108Updated last year
- Function Calling Benchmark & Testing☆92Updated last year
- Train an adapter for any embedding model in under a minute☆130Updated 9 months ago
- ☆107Updated 2 months ago
- A fast batching API to serve LLM models☆188Updated last year
- RAG example using DSPy, Gradio, FastAPI☆90Updated last year
- AI management tool☆119Updated last year
- Simple examples using Argilla tools to build AI☆57Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆47Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- One click templates for inferencing Language Models☆224Updated last month
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆67Updated last year
- ☆206Updated last year
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆93Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆97Updated 3 months ago
- Use smol agents to do research and then update csv coumns with its findings.☆41Updated 11 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆183Updated last year
- This project implements a demonstrator agent that compares the Cache-Augmented Generation (CAG) Framework with traditional Retrieval-Augm…☆31Updated last year
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆61Updated 11 months ago