cztomsik / ggml-js
JavaScript bindings for the ggml-js library
☆41Updated last year
Alternatives and similar repositories for ggml-js:
Users that are interested in ggml-js are comparing it to the libraries listed below
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 7 months ago
- tinygrad port of the RWKV large language model.☆44Updated 2 weeks ago
- ☆40Updated last year
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Node.js implementation binding for the RWKV.cpp module☆20Updated last year
- WebGPU LLM inference tuned by hand☆149Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆48Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆135Updated 8 months ago
- Implementation of the RWKV language model in pure WebGPU/Rust.☆295Updated this week
- Inference Llama 2 in one file of pure JavaScript(HTML)☆32Updated 8 months ago
- ☆13Updated last year
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆93Updated last year
- LLM-based code completion engine☆181Updated 2 months ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- Full finetuning of large language models without large memory requirements☆93Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated 2 years ago
- GPT-2 small trained on phi-like data☆65Updated last year
- ☆31Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆53Updated 11 months ago
- Framework agnostic python runtime for RWKV models☆145Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 6 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- ☆16Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- Generates grammer files from typescript for LLM generation☆37Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆246Updated 8 months ago