richardanaya / gbnfLinks
A library for working with GBNF files
☆27Updated 2 months ago
Alternatives and similar repositories for gbnf
Users that are interested in gbnf are comparing it to the libraries listed below
Sorting:
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆64Updated 2 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- Run AI models anywhere. https://muna.ai/explore☆74Updated last week
- Light WebUI for lm.rs☆24Updated last year
- ☆32Updated 2 years ago
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated 2 years ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Updated 6 months ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated 2 years ago
- Easily create LLM automation/agent workflows☆60Updated last year
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆51Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Updated 9 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- ☆10Updated 2 years ago
- Plug n Play GBNF Compiler for llama.cpp☆28Updated 2 years ago
- A guidance compatibility layer for llama-cpp-python☆36Updated 2 years ago
- Embedding models from Jina AI☆65Updated last year
- Official Rust Implementation of Model2Vec☆145Updated 3 months ago
- A CLI in Rust to generate synthetic data for MLX friendly training☆25Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year
- A high performance batching router optimises max throughput for text inference workload☆16Updated 2 years ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- Tools for formatting large language model prompts.☆13Updated 2 years ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- Claudetools is a Python library that enables function calling with the Claude 3 family of language models from Anthropic.☆38Updated 11 months ago
- Proxy server for triton gRPC server that inferences embedding model in Rust☆21Updated last year
- AirLLM 70B inference with single 4GB GPU☆14Updated 6 months ago
- Chat Markup Language conversation library☆55Updated 2 years ago
- powerful and fast tool calling agents☆79Updated 9 months ago