cztomsik / ggml-js
JavaScript bindings for the ggml-js library
☆41Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for ggml-js
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆45Updated 3 months ago
- BlinkDL's RWKV-v4 running in the browser☆46Updated last year
- ☆40Updated last year
- WebGPU LLM inference tuned by hand☆147Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- Inference Llama 2 in one file of pure JavaScript(HTML)☆30Updated 4 months ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆131Updated 4 months ago
- ☆31Updated 10 months ago
- tinygrad port of the RWKV large language model.☆43Updated 5 months ago
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆92Updated last year
- GPT-2 small trained on phi-like data☆65Updated 9 months ago
- Generates grammer files from typescript for LLM generation☆34Updated 9 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆20Updated last year
- Train your own small bitnet model☆56Updated last month
- Full finetuning of large language models without large memory requirements☆93Updated 10 months ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆102Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 11 months ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆73Updated last year
- Node.js implementation binding for the RWKV.cpp module☆20Updated last year
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated last year
- ☆13Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆54Updated 7 months ago
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆168Updated 2 months ago
- A JavaScript implementation of Llama 3 using node-mlx.☆69Updated 4 months ago
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆46Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year