ggml-org / llama.vscode

VSCode extension for LLM-assisted code/text completion

☆25

Alternatives and similar repositories for llama.vscode:

Users that are interested in llama.vscode are comparing it to the libraries listed below

spirobel / bunny-llama
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆48Updated last year
ggerganov / stable-diffusion.cpp
Stable Diffusion in pure C/C++
☆60Updated last year
ggerganov / bark.cpp
Port of Suno AI's Bark in C/C++ for fast inference
☆55Updated 9 months ago
iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆52Updated 10 months ago
jart / sqlite-vec
Work-in-progress vector search SQLite extension that runs anywhere.
☆9Updated 5 months ago
jameswdelancey / llama3.c
A faithful clone of Karpathy's llama2.c (one file inference, zero dependency) but fully functional with LLaMA 3 8B base and instruct mode…
☆113Updated 5 months ago
nomic-ai / kompute
General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …
☆43Updated 3 months ago
spectral-compute / scale-examples
☆51Updated 6 months ago
cjpais / whisperfile
☆53Updated 4 months ago
Linaro / tinyBLAS
A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support
☆14Updated 4 years ago
astrowonk / emoji_finder
emoji_finder
☆15Updated 3 weeks ago
lukasVierling / FaceRWKV
Course Project for COMP4471 on RWKV
☆16Updated 11 months ago
distantmagic / structured
Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp
☆43Updated 8 months ago
google / minja
A minimalistic C++ Jinja templating engine for LLM chat templates
☆96Updated this week
futo-org / whisper-acft
☆104Updated 6 months ago
PABannier / biogpt.cpp
Port of Microsoft's BioGPT in C/C++ using ggml
☆88Updated 10 months ago
ggerganov / vit.cpp
Inference Vision Transformer (ViT) in plain C/C++ with ggml
☆31Updated last year
samuel-vitorino / lm.rs-webui
Light WebUI for lm.rs
☆22Updated 3 months ago
KerfuffleV2 / smolrsrwkv
A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…
☆93Updated last year
leaningtech / cheerpx-meta
Run X86 binary applications and libraries in the browser
☆35Updated last month
clebert / llama2.zig
Inference Llama 2 in pure Zig
☆43Updated last year
rahuldshetty / starcoder.js
Web browser version of StarCoder.cpp
☆43Updated last year
ggerganov / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆27Updated 11 months ago
mrconter1 / human-level-agi-definition
☆27Updated 3 weeks ago
Cyberhan123 / stable-diffusion-desktop
Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).
☆18Updated last week
epicure / llama2.js
Inference Llama 2 in one file of pure JavaScript(HTML)
☆30Updated 6 months ago
hackyourlife / brawshot
Temporal noise reduction for videos in BRAW format to get long virtual exposure time
☆29Updated 7 months ago
symisc / tiny-dream
Tiny Dream - An embedded, Header Only, Stable Diffusion C++ implementation
☆257Updated last year
wozeparrot / tinyrwkv
tinygrad port of the RWKV large language model.
☆44Updated 7 months ago
viznut / vzgpt
Viznut's C-only GPT-2 implementation
☆50Updated 2 years ago