WebGPU LLM inference tuned by hand
☆150Jun 24, 2023Updated 2 years ago
Alternatives and similar repositories for token-hawk
Users that are interested in token-hawk are comparing it to the libraries listed below
Sorting:
- trying to make WebGPU a bit easier to use☆19Jan 9, 2024Updated 2 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- ggml implementation of BERT☆498Feb 23, 2024Updated 2 years ago
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆26Oct 2, 2025Updated 5 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Jun 13, 2023Updated 2 years ago
- A Next.js chat app to use Llama 2 locally using node-llama-cpp☆12Oct 27, 2024Updated last year
- A distributed execution framework built upon lunatic.☆16Jan 19, 2024Updated 2 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- A lightweight Python utility that aggregates and exports comprehensive system information to JSON, specifically designed for feeding syst…☆13Apr 13, 2025Updated 10 months ago
- A Swift package for interacting with selenium and undetected-chromedriver through python by using PythonKit.☆13Jun 21, 2025Updated 8 months ago
- Erudito: Easy API/CLI to ask questions about your documentation☆99Nov 6, 2023Updated 2 years ago
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- Makes llama.cpp easy to use.☆12May 14, 2025Updated 9 months ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15May 3, 2021Updated 4 years ago
- A repo to hold some simple experiments☆14May 4, 2022Updated 3 years ago
- A torchless, c++ rwkv implementation using 8bit quantization, written in cuda/hip/vulkan for maximum compatibility and minimum dependenci…☆312Jan 31, 2024Updated 2 years ago
- A guidance language for controlling large language models.☆45Jun 9, 2023Updated 2 years ago
- Builds Dawn on Linux and macOS as one single easier-to-use library☆29Dec 5, 2021Updated 4 years ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆857Nov 16, 2024Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,564Mar 23, 2025Updated 11 months ago
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆15Feb 12, 2026Updated 3 weeks ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated last year
- ☆29Feb 27, 2024Updated 2 years ago
- Hardware-accelerated matrix/numeric programming library for Swift☆12Sep 2, 2025Updated 6 months ago
- 🧪 Model Loader API☆33Feb 26, 2024Updated 2 years ago
- Breathing Life into Machines☆13Apr 24, 2024Updated last year
- A Next.js chatbot app demonstrating seamless integration with window.ai.☆15Jun 25, 2023Updated 2 years ago
- Some RSA attacks with sage☆11Nov 15, 2016Updated 9 years ago
- minimal diffusion transformer in pytorch.☆17Oct 6, 2024Updated last year
- ☆14Jul 6, 2023Updated 2 years ago
- https://hf.co/hexgrad/Kokoro-82M☆14Jan 14, 2026Updated last month
- A small stack-based audio language.☆21Sep 10, 2024Updated last year
- C++ implementation for 💫StarCoder☆459Sep 9, 2023Updated 2 years ago
- A cross-platform browser ML framework.☆747Nov 23, 2024Updated last year
- A Personalised AI Assistant Inspired by 'Diamond Age, Powered by SMS☆92Jun 1, 2023Updated 2 years ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆38Jun 21, 2024Updated last year
- Stupidly Simple Audio Streaming Library☆17Sep 7, 2016Updated 9 years ago
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated 10 months ago
- MPS like shaders for audio processing. Conv1d, Spectrogram.☆18Apr 3, 2021Updated 4 years ago