richardanaya / gbnfLinks
A library for working with GBNF files
☆27Updated last month
Alternatives and similar repositories for gbnf
Users that are interested in gbnf are comparing it to the libraries listed below
Sorting:
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- Run AI models anywhere. https://muna.ai/explore☆71Updated this week
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated 2 years ago
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated 2 years ago
- Light WebUI for lm.rs☆24Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆62Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- utilities for loading and running text embeddings with onnx☆44Updated 3 months ago
- A modular framework for building massively parallel agentic systems☆29Updated 3 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- Plug n Play GBNF Compiler for llama.cpp☆28Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- A daemon that makes a desktop OS accessible to AI agents☆36Updated 6 months ago
- ☆59Updated 8 months ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆44Updated last year
- Run embedding models using ONNX☆35Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 8 months ago
- A guidance compatibility layer for llama-cpp-python☆36Updated 2 years ago
- ☆32Updated last year
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆51Updated 2 years ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- ☆10Updated 2 years ago
- Create an LLM XML context document from an llms.txt file☆23Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Updated 2 years ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- Embedding models from Jina AI☆65Updated last year
- Official servlets for mcp.run published by @dylibso☆64Updated 3 weeks ago
- Easily create LLM automation/agent workflows☆60Updated last year