nicholasyager / llama-cpp-guidance
A guidance compatibility layer for llama-cpp-python
☆34Updated last year
Alternatives and similar repositories for llama-cpp-guidance:
Users that are interested in llama-cpp-guidance are comparing it to the libraries listed below
- Plug n Play GBNF Compiler for llama.cpp☆25Updated last year
- Complex RAG backend☆28Updated last year
- ☆66Updated 11 months ago
- ☆38Updated last year
- The one who calls upon functions - Function-Calling Language Model☆36Updated last year
- ☆38Updated last year
- run ollama & gguf easily with a single command☆50Updated 11 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated 10 months ago
- ☆31Updated last year
- Embed anything.☆29Updated 11 months ago
- entropix style sampling + GUI☆26Updated 6 months ago
- GPT-2 small trained on phi-like data☆66Updated last year
- Chat Markup Language conversation library☆55Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 7 months ago
- ☆73Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Updated last year
- Easily view and modify JSON datasets for large language models☆75Updated 2 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 11 months ago
- ☆112Updated 4 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆34Updated last year
- A framework for evaluating function calls made by LLMs☆37Updated 9 months ago
- large language model for mastering data analysis using pandas☆47Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated last week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆29Updated 11 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated last month
- A web-app to explore topics using LLM (less typing and more clicks)☆66Updated last year