nova-land / gbnf-compiler
Plug n Play GBNF Compiler for llama.cpp
☆24Updated last year
Alternatives and similar repositories for gbnf-compiler:
Users that are interested in gbnf-compiler are comparing it to the libraries listed below
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- ☆31Updated last year
- ☆22Updated 9 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- A Python package designed to simplify the process of creating and managing function calls to OpenAI's API, as well as models using LiteLL…☆14Updated 5 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated 11 months ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- ☆65Updated 9 months ago
- ☆38Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆59Updated 7 months ago
- ☆38Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated last week
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- GPT-2 small trained on phi-like data☆65Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 7 months ago
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆29Updated 9 months ago
- Chat Markup Language conversation library☆55Updated last year
- Embed anything.☆29Updated 9 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 4 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 5 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆32Updated last year
- ☆73Updated last year
- A pythonic library providing light-weighted interface with LLMs☆124Updated 4 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 6 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆72Updated last week
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆73Updated 3 months ago
- LMQL implementation of tree of thoughts☆34Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- ☆34Updated last year
- ☆22Updated 7 months ago