hyparam / hyllama
llama.cpp gguf file parser for javascript
β30Updated last month
Alternatives and similar repositories for hyllama:
Users that are interested in hyllama are comparing it to the libraries listed below
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.β44Updated 5 months ago
- Chrome Extension for exploring Hugging Face datasets πβ49Updated 3 months ago
- One Line To Build Zero-Data Classifiers in Minutesβ33Updated 3 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Largβ¦β17Updated 2 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBreadβ18Updated 9 months ago
- A simple library for working with Hugging Face models.β14Updated 2 weeks ago
- implementation of https://arxiv.org/pdf/2312.09299β20Updated 6 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β61Updated this week
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.shβ48Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API includedβ14Updated 3 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Modelsβ44Updated last year
- β38Updated 10 months ago
- utilities for loading and running text embeddings with onnxβ40Updated 5 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE modelβ17Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLXβ15Updated 2 months ago
- ANE accelerated embedding models!β14Updated last month
- Embedding models from Jina AIβ57Updated last year
- Web browser version of StarCoder.cppβ43Updated last year
- GGML implementation of BERT model with Python bindings and quantization.β52Updated 10 months ago
- wasm bindings for huggingface tokenizers libraryβ35Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.β66Updated last year
- Exploration of Vector database Index for fast approximate nearest neighbour search.β16Updated 5 months ago
- Tokun to can tokensβ15Updated 2 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ49Updated 10 months ago
- β21Updated 7 months ago
- β35Updated last month
- Using modal.com to process FineWeb-edu dataβ19Updated last month
- Nexusflow function call, tool use, and agent benchmarks.β18Updated last month
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate codeβ44Updated last year
- G2Pβ20Updated this week