nova-land / gbnf-compiler
Plug n Play GBNF Compiler for llama.cpp
☆24Updated last year
Alternatives and similar repositories for gbnf-compiler:
Users that are interested in gbnf-compiler are comparing it to the libraries listed below
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- ☆38Updated last year
- LMQL implementation of tree of thoughts☆34Updated last year
- ☆31Updated last year
- Experimental sampler to make LLMs more creative☆30Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆76Updated 4 months ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- ☆66Updated 10 months ago
- Embed anything.☆29Updated 10 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- Generates grammer files from typescript for LLM generation☆37Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 8 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- Chat Markup Language conversation library☆55Updated last year
- ☆22Updated 10 months ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆53Updated last year
- GPT-2 small trained on phi-like data☆66Updated last year
- Python console application designed to provide an engaging and visually appealing LLM chat experience on Unix-like consoles or Terminals.☆22Updated 9 months ago
- Very minimal (and stateless) agent framework☆41Updated 3 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆86Updated last month
- Using modal.com to process FineWeb-edu data☆20Updated last week
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆33Updated last year
- GenAI & agent toolkit for Apple Silicon Mac, implementing JSON schema-steered structured output (3SO) and tool-calling in Python. For mor…☆121Updated last month
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 6 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆49Updated 11 months ago
- ☆38Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆66Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆36Updated 6 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 11 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 5 months ago