FL33TW00D / wattkitLinks

☆14

Alternatives and similar repositories for wattkit

Users that are interested in wattkit are comparing it to the libraries listed below

Sorting:

spirobel / bunny-llama
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆50Updated 2 years ago
FL33TW00D / deCoreML
Find out why your CoreML model isn't running on the Neural Engine!
☆27Updated last year
smpanaro / ModernBERT-AppleNeuralEngine
ModernBERT model optimized for Apple Neural Engine.
☆28Updated 10 months ago
guidance-ai / llgtrt
TensorRT-LLM server with Structured Outputs (JSON) built with Rust
☆61Updated 7 months ago
Vaibhavs10 / fast-llm.rs
☆140Updated last year
jsgrad-org / jsgrad
jsgrad is a dependency-free ML library in Typescript for model inference and training with support to WebGPU and other runtimes.
☆60Updated 7 months ago
ngxson / ggml-easy
Thin wrapper around GGML to make life easier
☆40Updated 3 weeks ago
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
LaurentMazare / glim
☆19Updated last year
zanussbaum / surfgrad
webgpu autograd library
☆33Updated 6 months ago
FL33TW00D / laserbeak
Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU
☆106Updated 2 years ago
cloudflare / workers-wonnx
☆47Updated 7 months ago
ml-explore / mlx-c
C API for MLX
☆153Updated last week
Narsil / hf-chat
☆26Updated 11 months ago
m1guelpf / repair-json
Repair incomplete JSON (e.g. from streaming APIs or AI models) so it can be parsed as it's received.
☆36Updated last year
AmineDiro / docvec
Semantic search webassembly module
☆18Updated last year
firstbatchxyz / dria-sdk
Dria SDK is for building and executing synthetic data generation pipelines on Dria Knowledge Network.
☆29Updated 7 months ago
LaurentMazare / ug
Experimental compiler for deep learning models
☆71Updated 2 months ago
FL33TW00D / coremlprofiler
Profile your CoreML models directly from Python 🐍
☆29Updated 2 months ago
emmyoh / zebra
A vector database for querying meaningfully similar data.
☆16Updated 8 months ago
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆65Updated last year
kayvr / token-hawk
WebGPU LLM inference tuned by hand
☆151Updated 2 years ago
danielgross / ggml-k8s
Run GGML models with Kubernetes.
☆175Updated last year
richardanaya / gbnf
A library for working with GBNF files
☆26Updated 3 weeks ago
LaurentMazare / mamba.rs
☆135Updated last year
okuvshynov / llama_duo
asynchronous/distributed speculative evaluation for llama3
☆39Updated last year
PrimeIntellect-ai / toploc
TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…
☆48Updated 7 months ago
MacPaw / macapptree
Repository for macos accessibility parser
☆22Updated last month
kyutai-labs / moshi-swift
☆123Updated 5 months ago
doomslide / baby-compiler
It's a baby compiler. (Lean btw.)
☆16Updated 6 months ago