nicholasyager / llama-cpp-guidance
A guidance compatibility layer for llama-cpp-python
☆34Updated last year
Alternatives and similar repositories for llama-cpp-guidance:
Users that are interested in llama-cpp-guidance are comparing it to the libraries listed below
- ☆38Updated last year
- Plug n Play GBNF Compiler for llama.cpp☆24Updated last year
- GPT-2 small trained on phi-like data☆66Updated last year
- Complex RAG backend☆28Updated last year
- The one who calls upon functions - Function-Calling Language Model☆36Updated last year
- ☆66Updated 10 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 6 months ago
- ☆38Updated last year
- Let's create synthetic textbooks together :)☆74Updated last year
- ☆31Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆86Updated last month
- Generate Structured JSON with probs from Language Models☆16Updated 3 weeks ago
- run ollama & gguf easily with a single command☆50Updated 10 months ago
- entropix style sampling + GUI☆25Updated 5 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆33Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- ☆112Updated 3 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 5 months ago
- Embed anything.☆29Updated 10 months ago
- ☆55Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 7 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Easily create LLM automation/agent workflows☆59Updated last year
- ☆20Updated last year
- autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…☆57Updated last year
- ☆153Updated 8 months ago
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEnd☆26Updated last year
- A fast batching API to serve LLM models☆183Updated 11 months ago