philpax / ggmlLinks
Tensor library for machine learning
☆21Updated last year
Alternatives and similar repositories for ggml
Users that are interested in ggml are comparing it to the libraries listed below
Sorting:
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆52Updated 7 months ago
- Light WebUI for lm.rs☆24Updated 11 months ago
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- Transformer GPU VRAM estimator☆66Updated last year
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆112Updated last month
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆277Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆89Updated this week
- First token cutoff sampling inference example☆31Updated last year
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆73Updated 7 months ago
- Python bindings for ggml☆146Updated last year
- Inference Llama 2 in one file of pure C++☆83Updated 2 years ago
- A collection of all available inference solutions for the LLMs☆91Updated 6 months ago
- GGUF implementation in C as a library and a tools CLI program☆291Updated 3 weeks ago
- llama.cpp to PyTorch Converter☆34Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.