hyparam / hyllamaLinks
llama.cpp gguf file parser for javascript
☆50Updated last year
Alternatives and similar repositories for hyllama
Users that are interested in hyllama are comparing it to the libraries listed below
Sorting:
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- Embedding models from Jina AI☆65Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated last year
- Generates grammer files from typescript for LLM generation☆38Updated last year
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆44Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- Parallel wasm Barnes-Hut t-SNE implementation written in Rust.☆21Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆141Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated 2 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Updated 9 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 8 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 4 months ago
- Browse, search, and visualize ONNX models.☆34Updated 7 months ago
- Using embeddings compressed by Product Quantization, in Javascript☆31Updated 2 years ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆50Updated 2 years ago
- Website with current metrics on the fastest AI models.☆42Updated last year
- Hyperparam local dataset viewer☆26Updated this week
- A guidance compatibility layer for llama-cpp-python☆36Updated 2 years ago
- ☆38Updated last year
- Generate BM25 sparse vector inside PostgreSQL☆87Updated last year
- Simple LLM inference server☆20Updated last year
- kokoro text to speech using javascript☆63Updated 10 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆109Updated 8 months ago
- Chat Markup Language conversation library☆55Updated last year
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated 2 years ago
- ☆62Updated last year
- Structured inference with Llama 2 in your browser☆53Updated last year
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Updated 6 months ago
- Benchmarking suite for popular AI APIs☆89Updated 10 months ago