adrienbrault / json-schema-to-gbnf
Converts JSON-Schema to GBNF grammar to use with llama.cpp
☆52Updated last year
Alternatives and similar repositories for json-schema-to-gbnf:
Users that are interested in json-schema-to-gbnf are comparing it to the libraries listed below
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆135Updated 9 months ago
- Generates grammer files from typescript for LLM generation☆37Updated last year
- Plug n Play GBNF Compiler for llama.cpp☆25Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆66Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- ☆30Updated last year
- Embedding models from Jina AI☆59Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- hnsqlite integrates hnswlib and sqlite for simple text embedding search☆159Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 9 months ago
- Using Large Language Models for Repo-wide Type Prediction☆109Updated last year
- A library for working with GBNF files☆21Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Extensible AI assistant platform that bridges LLMs to tasks and actions☆38Updated 2 years ago
- ☆35Updated 2 years ago
- LLM plugin for clustering embeddings☆75Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 9 months ago
- Simple repo that compiles and runs llama2.c on the Web☆54Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆48Updated last year
- Run GGML models with Kubernetes.☆173Updated last year
- Port of Facebook's LLaMA model in C/C++☆45Updated last year
- Chat Markup Language conversation library☆55Updated last year
- ☆112Updated 3 months ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year
- Local Startup Advisor Chatbot☆31Updated last year
- ☆135Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆76Updated last year
- WebGPU LLM inference tuned by hand☆149Updated last year
- Use context-free grammars with an LLM☆168Updated last year