nova-land / gbnf-compiler
Plug n Play GBNF Compiler for llama.cpp
☆25Updated last year
Alternatives and similar repositories for gbnf-compiler:
Users that are interested in gbnf-compiler are comparing it to the libraries listed below
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- ☆31Updated last year
- Experimental LLM Inference UX to aid in creative writing☆116Updated 4 months ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- ☆35Updated 2 years ago
- ☆38Updated last year
- Generates grammer files from typescript for LLM generation☆37Updated last year
- ☆38Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated 10 months ago
- GPT-2 small trained on phi-like data☆66Updated last year
- Embed anything.☆29Updated 11 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆53Updated 7 months ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- ☆22Updated 11 months ago
- ☆66Updated 11 months ago
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆29Updated 11 months ago
- large language model for mastering data analysis using pandas☆47Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated last year
- The one who calls upon functions - Function-Calling Language Model☆36Updated last year
- ☆17Updated 4 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 9 months ago
- Chat Markup Language conversation library☆55Updated last year
- ☆130Updated last week
- Using modal.com to process FineWeb-edu data☆20Updated last month
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆78Updated 4 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year