adrienbrault / json-schema-to-gbnfLinks
Converts JSON-Schema to GBNF grammar to use with llama.cpp
☆55Updated 2 years ago
Alternatives and similar repositories for json-schema-to-gbnf
Users that are interested in json-schema-to-gbnf are comparing it to the libraries listed below
Sorting:
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆141Updated last year
- Generates grammer files from typescript for LLM generation☆38Updated last year
- Embedding models from Jina AI☆65Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Extend the original llama.cpp repo to support redpajama model.☆118Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- ☆31Updated 2 years ago
- LLM plugin for clustering embeddings☆82Updated last year
- hnsqlite integrates hnswlib and sqlite for simple text embedding search☆161Updated 2 years ago
- Use context-free grammars with an LLM☆175Updated last year
- A library for working with GBNF files☆27Updated last month
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- Plug n Play GBNF Compiler for llama.cpp☆28Updated 2 years ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆107Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.☆153Updated 2 years ago
- Chat Markup Language conversation library☆55Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- utilities for loading and running text embeddings with onnx☆44Updated 4 months ago
- ⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.☆146Updated last year
- Run GGML models with Kubernetes.☆175Updated 2 years ago
- Local Startup Advisor Chatbot☆32Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆50Updated 2 years ago
- ☆114Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- Using Large Language Models for Repo-wide Type Prediction☆112Updated 2 years ago
- Enforce structured output from LLMs 100% of the time☆248Updated last year
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆54Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated last year