cztomsik / ggml-js
JavaScript bindings for the ggml-js library
☆40Updated last year
Alternatives and similar repositories for ggml-js:
Users that are interested in ggml-js are comparing it to the libraries listed below
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 7 months ago
- WebGPU LLM inference tuned by hand☆149Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆136Updated 8 months ago
- ☆40Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Node.js implementation binding for the RWKV.cpp module☆20Updated last year
- Inference Llama 2 in one file of pure JavaScript(HTML)☆31Updated 8 months ago
- ☆31Updated last year
- tinygrad port of the RWKV large language model.☆44Updated this week
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- GPT-2 small trained on phi-like data☆65Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆48Updated last year
- webassembly binding for Hora Approximate Nearest Neighbor Search Library☆55Updated 3 years ago
- Full finetuning of large language models without large memory requirements☆93Updated last year
- Train your own small bitnet model☆65Updated 4 months ago
- Structured inference with Llama 2 in your browser☆52Updated 4 months ago
- ☆13Updated last year
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆30Updated last year
- A JavaScript implementation of Llama 3 using node-mlx.☆72Updated 7 months ago
- trying to make WebGPU a bit easier to use☆16Updated last year
- LLM-based code completion engine☆181Updated last month
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Machine learning framework for Node.js.☆191Updated 3 weeks ago
- A converter and basic tester for rwkv onnx☆42Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆76Updated last year
- Implementation of the RWKV language model in pure WebGPU/Rust.☆294Updated this week
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆186Updated 6 months ago
- Python bindings for ggml☆140Updated 6 months ago
- A highly customizable, full scale web backend for web-rwkv, built on axum with websocket protocol.☆26Updated 10 months ago