ggml-org / llama.vscode
VSCode extension for LLM-assisted code/text completion
☆25Updated last week
Alternatives and similar repositories for llama.vscode:
Users that are interested in llama.vscode are comparing it to the libraries listed below
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆48Updated last year
- Stable Diffusion in pure C/C++☆60Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆55Updated 9 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆52Updated 10 months ago
- Work-in-progress vector search SQLite extension that runs anywhere.☆9Updated 5 months ago
- A faithful clone of Karpathy's llama2.c (one file inference, zero dependency) but fully functional with LLaMA 3 8B base and instruct mode…☆113Updated 5 months ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆43Updated 3 months ago
- ☆51Updated 6 months ago
- ☆53Updated 4 months ago
- A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support☆14Updated 4 years ago
- emoji_finder☆15Updated 3 weeks ago
- Course Project for COMP4471 on RWKV☆16Updated 11 months ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆43Updated 8 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆96Updated this week
- ☆104Updated 6 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆88Updated 10 months ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆31Updated last year
- Light WebUI for lm.rs☆22Updated 3 months ago
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆93Updated last year
- Run X86 binary applications and libraries in the browser☆35Updated last month
- Inference Llama 2 in pure Zig☆43Updated last year
- Web browser version of StarCoder.cpp☆43Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆27Updated 11 months ago
- ☆27Updated 3 weeks ago
- Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).☆18Updated last week
- Inference Llama 2 in one file of pure JavaScript(HTML)☆30Updated 6 months ago
- Temporal noise reduction for videos in BRAW format to get long virtual exposure time☆29Updated 7 months ago
- Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation☆257Updated last year
- tinygrad port of the RWKV large language model.☆44Updated 7 months ago
- Viznut's C-only GPT-2 implementation☆50Updated 2 years ago