richardanaya / gbnfLinks
A library for working with GBNF files
☆25Updated 3 weeks ago
Alternatives and similar repositories for gbnf
Users that are interested in gbnf are comparing it to the libraries listed below
Sorting:
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 10 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- Light WebUI for lm.rs☆24Updated 11 months ago
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆62Updated last year
- Run AI models anywhere. https://muna.ai/explore☆67Updated this week
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year
- ☆16Updated last year
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated 2 years ago
- A modular framework for building massively parallel agentic systems☆29Updated 3 weeks ago
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Updated 2 years ago
- Official Rust Implementation of Model2Vec☆138Updated last week
- Extract structured data from local or remote LLM models☆47Updated last year
- ☆31Updated last year
- ☆10Updated 2 years ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated last year
- ☆60Updated 5 months ago
- A guidance compatibility layer for llama-cpp-python☆36Updated 2 years ago
- Create an LLM XML context document from an llms.txt file☆22Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 6 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆56Updated 3 months ago
- Embedding models from Jina AI☆65Updated last year
- Chat Markup Language conversation library☆55Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated last month
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆95Updated 3 months ago
- ☆38Updated last year
- George is an API leveraging AI to make it easy to control a computer with natural language.☆50Updated 9 months ago