nicholasyager / llama-cpp-guidanceLinks
A guidance compatibility layer for llama-cpp-python
☆36Updated 2 years ago
Alternatives and similar repositories for llama-cpp-guidance
Users that are interested in llama-cpp-guidance are comparing it to the libraries listed below
Sorting:
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆47Updated last year
- The one who calls upon functions - Function-Calling Language Model☆36Updated 2 years ago
- Complex RAG backend☆29Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 5 months ago
- Plug n Play GBNF Compiler for llama.cpp☆28Updated 2 years ago
- ☆68Updated last year
- GPT-2 small trained on phi-like data☆68Updated last year
- Let's create synthetic textbooks together :)☆76Updated 2 years ago
- ☆119Updated last year
- Client-side toolkit for using large language models, including where self-hosted☆115Updated last week
- ☆38Updated last year
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEnd☆26Updated 2 years ago
- large language model for mastering data analysis using pandas☆48Updated 2 years ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated 2 years ago
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- Easily view and modify JSON datasets for large language models☆87Updated 8 months ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆45Updated 2 years ago
- Embed anything.☆27Updated last year
- ☆32Updated 2 years ago
- A fast batching API to serve LLM models☆189Updated last year
- ☆40Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆44Updated last year
- Code action agent with local execution sandbox and first-class support for programmatic tool calling☆121Updated this week
- run ollama & gguf easily with a single command☆52Updated last year
- ☆74Updated 2 years ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated 2 years ago
- Simple Graph Memory for AI applications☆90Updated 8 months ago
- An OpenAI-like LLaMA inference API☆113Updated 2 years ago
- Experimental LLM Inference UX to aid in creative writing☆128Updated last year