cztomsik / ggml-js
JavaScript bindings for the ggml-js library
☆43Updated 3 weeks ago
Alternatives and similar repositories for ggml-js:
Users that are interested in ggml-js are comparing it to the libraries listed below
- tinygrad port of the RWKV large language model.☆44Updated last month
- ☆40Updated 2 years ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆135Updated 9 months ago
- WebGPU LLM inference tuned by hand☆149Updated last year
- Node.js implementation binding for the RWKV.cpp module☆20Updated last year
- Inference Llama 2 in one file of pure JavaScript(HTML)☆33Updated 9 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 8 months ago
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- A JavaScript implementation of Llama 3 using node-mlx.☆72Updated 9 months ago
- LLM-based code completion engine☆181Updated 2 months ago
- ☆13Updated last year
- trying to make WebGPU a bit easier to use☆16Updated last year
- Llama2 inference in one TypeScript file☆17Updated 10 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆48Updated last year
- webassembly binding for Hora Approximate Nearest Neighbor Search Library☆57Updated 3 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆30Updated last year
- Embeddings focused small version of Llama NLP model☆103Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated 2 years ago
- Chatbot that answers frequently asked questions in French, English, and Tunisian using the Rasa NLU framework and RWKV-4-Raven☆13Updated last year
- Implementation of the RWKV language model in pure WebGPU/Rust.☆298Updated last week
- Full finetuning of large language models without large memory requirements☆94Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- hnswlib-wasm attempts to create a browser friendly version of hnswlib☆45Updated last year
- A JavaScript and TypeScript port of PyTorch C++ library (libtorch) - Node.js N-API bindings for libtorch.☆16Updated 2 years ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆88Updated last year
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year