rahuldshetty / starcoder.jsLinks
Web browser version of StarCoder.cpp
☆46Updated 2 years ago
Alternatives and similar repositories for starcoder.js
Users that are interested in starcoder.js are comparing it to the libraries listed below
Sorting:
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Experiments with BitNet inference on CPU☆55Updated last year
- tinygrad port of the RWKV large language model.☆45Updated 10 months ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆23Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- Embeddings focused small version of Llama NLP model☆107Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆50Updated 2 years ago
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Updated last year
- ☆11Updated 2 years ago
- A super simple web interface to perform blind tests on LLM outputs.☆29Updated last year
- Inference of Mamba and Mamba2 models in pure C☆196Updated 2 weeks ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated 2 years ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.☆154Updated 2 years ago
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- Download full or partial git-lfs repos without temporarily using 2x disk space☆30Updated 2 years ago
- ☆62Updated last year
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- ☆26Updated 2 years ago
- BlinkDL's RWKV-v4 running in the browser☆48Updated 2 years ago
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆150Updated last year
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated 2 years ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆51Updated 11 months ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆141Updated last year
- ☆35Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆31Updated 2 years ago