richardanaya / gbnf
A library for working with GBNF files
☆19Updated 8 months ago
Related projects: ⓘ
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated 9 months ago
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆52Updated 11 months ago
- ☆15Updated 6 months ago
- ☆31Updated 8 months ago
- Easily create LLM automation/agent workflows☆54Updated 7 months ago
- ☆28Updated this week
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Updated last month
- A web-app to explore topics using LLM (less typing and more clicks)☆65Updated 8 months ago
- A guidance compatibility layer for llama-cpp-python☆35Updated last year
- ☆10Updated last year
- Jupyter notebooks for cloud-based usage☆10Updated last year
- Draft42 - Streamlit chatbot with function calling☆28Updated 5 months ago
- Embedding models from Jina AI☆55Updated 8 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆28Updated last week
- Simple LLM inference server☆16Updated 3 months ago
- A distributed agent orchestration framework for market agents☆17Updated this week
- utilities for loading and running text embeddings with onnx☆39Updated last month
- ☆18Updated 6 months ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆49Updated 9 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 7 months ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆30Updated 3 months ago
- The fastest CLI tool for prompting LLMs. Including support for prompting several LLMs at once!☆54Updated 3 weeks ago
- A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)☆64Updated 2 weeks ago
- jQuery, React and Streamlit applications written by LLMs☆16Updated 8 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- LLM plugin for models hosted by OpenRouter☆69Updated 4 months ago
- Proxy server for triton gRPC server that inferences embedding model in Rust☆16Updated last month
- Run embedding models using ONNX☆23Updated 7 months ago
- ☆36Updated 6 months ago
- Mistral-7B finetuned for function calling☆14Updated 7 months ago