IntrinsicLabsAI / gbnfgenLinks

TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces

☆140

Alternatives and similar repositories for gbnfgen

Users that are interested in gbnfgen are comparing it to the libraries listed below

Sorting:

IntrinsicLabsAI / grammar-builder
Generates grammer files from typescript for LLM generation
☆38Updated last year
belladoreai / llama-tokenizer-js
JS tokenizer for LLaMA 1 and 2
☆360Updated last year
adrienbrault / json-schema-to-gbnf
Converts JSON-Schema to GBNF grammar to use with llama.cpp
☆55Updated last year
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆65Updated last year
ahoylabs / gguf.js
A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.
☆50Updated last year
kayvr / token-hawk
WebGPU LLM inference tuned by hand
☆150Updated 2 years ago
do-me / SemanticFinder
SemanticFinder - frontend-only live semantic search with transformers.js
☆300Updated 6 months ago
danielgross / ggml-k8s
Run GGML models with Kubernetes.
☆173Updated last year
tangledgroup / llama-cpp-wasm
WebAssembly (Wasm) Build and Bindings for llama.cpp
☆283Updated last year
togethercomputer / redpajama.cpp
Extend the original llama.cpp repo to support redpajama model.
☆118Updated last year
pulzeai-oss / knn-router
☆112Updated last year
QuixiAI / OpenChatML
☆162Updated 2 months ago
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆99Updated 2 years ago
automorphic-ai / trex
Enforce structured output from LLMs 100% of the time
☆248Updated last year
abacaj / openhermes-function-calling
☆134Updated last year
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
ggml-org / p1
LLM-based code completion engine
☆190Updated 9 months ago
FL33TW00D / laserbeak
Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU
☆104Updated 2 years ago
newhouseb / clownfish
Constrained Decoding for LLMs against JSON Schema
☆329Updated 2 years ago
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆93Updated last month
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆118Updated last year
cmp-nct / ggllm.cpp
Falcon LLM ggml framework with CPU and GPU support
☆247Updated last year
nova-land / gbnf-compiler
Plug n Play GBNF Compiler for llama.cpp
☆28Updated last year
bananaml / potassium
An HTTP serving framework by Banana
☆101Updated last year
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆221Updated last year
Maximilian-Winter / llama_cpp_function_calling
☆31Updated last year
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆108Updated 2 years ago
lastmile-ai / llama-retrieval-plugin
LLaMa retrieval plugin script using OpenAI's retrieval plugin
☆323Updated 2 years ago
r2d4 / rellm
Exact structure out of any language model completion.
☆513Updated 2 years ago
LLukas22 / llm-rs-python
Unofficial python bindings for the rust llm library. 🐍❤️🦀
☆76Updated 2 years ago