nova-land / gbnf-compilerLinks
Plug n Play GBNF Compiler for llama.cpp
β28Updated 2 years ago
Alternatives and similar repositories for gbnf-compiler
Users that are interested in gbnf-compiler are comparing it to the libraries listed below
Sorting:
- A guidance compatibility layer for llama-cpp-pythonβ36Updated 2 years ago
- Unofficial python bindings for the rust llm library. πβ€οΈπ¦β76Updated 2 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β43Updated last year
- A pythonic library providing light-weighted interface with LLMsβ130Updated 6 months ago
- GPT-2 small trained on phi-like dataβ67Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.β25Updated last year
- Experimental LLM Inference UX to aid in creative writingβ127Updated last year
- β32Updated last year
- β68Updated last year
- Experimental sampler to make LLMs more creativeβ31Updated 2 years ago
- Simple Graph Memory for AI applicationsβ89Updated 6 months ago
- β164Updated 4 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.β99Updated 5 months ago
- The one who calls upon functions - Function-Calling Language Modelβ36Updated 2 years ago
- Generates grammer files from typescript for LLM generationβ38Updated last year
- Verbosity control for AI agentsβ64Updated last year
- β117Updated 11 months ago
- β54Updated 2 years ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async APIβ45Updated last year
- Embedding models from Jina AIβ65Updated last year
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.β46Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β78Updated last year
- Client-side toolkit for using large language models, including where self-hostedβ113Updated 2 weeks ago
- Efficient computer use agent powered by Meta Llama 4 Maverickβ45Updated 7 months ago
- Replace expensive LLM calls with finetunes automaticallyβ66Updated last year
- Low-Rank adapter extraction for fine-tuned transformers modelsβ179Updated last year
- GRDN.AI app for garden optimizationβ69Updated 3 weeks ago
- The code we currently use to fine-tune models.β117Updated last year
- utilities for loading and running text embeddings with onnxβ44Updated 3 months ago
- Chat Markup Language conversation libraryβ55Updated last year