hyparam / hyllama
llama.cpp gguf file parser for javascript
☆27Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for hyllama
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆45Updated 3 months ago
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- A simple library for working with Hugging Face models.☆15Updated 2 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆33Updated last month
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Updated 2 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆19Updated 4 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 11 months ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated 5 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- ☆31Updated 4 months ago
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆21Updated last month
- Public reports detailing responses to sets of prompts by Large Language Models.☆26Updated last year
- ☆29Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆22Updated last month
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆51Updated 11 months ago
- LLM plugin for embeddings using sentence-transformers☆43Updated 9 months ago
- Embedding models from Jina AI☆56Updated 10 months ago
- ☆22Updated last year
- ☆18Updated this week
- Generates grammer files from typescript for LLM generation☆34Updated 9 months ago
- A live multiplayer trivia game where users can bid for the subject of the next question☆22Updated 2 weeks ago
- ☆21Updated 5 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆46Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- Plug n Play GBNF Compiler for llama.cpp☆19Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated last year
- ☆38Updated 8 months ago
- RWKV-7: Surpassing GPT☆45Updated this week