PrimeIntellect-ai / pi-quantLinks

SIMD quantization kernels

☆76

Alternatives and similar repositories for pi-quant

Users that are interested in pi-quant are comparing it to the libraries listed below

Sorting:

xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
PrimeIntellect-ai / pccl
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
☆98Updated 2 weeks ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆138Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 4 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated 9 months ago
PrimeIntellect-ai / prime-vllm
Modded vLLM to run pipeline parallelism over public networks
☆37Updated 2 months ago
xjdr-alt / entropix-trainer
train entropix like a champ!
☆19Updated 9 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆68Updated 3 months ago
doomslide / hyperobject
Plotting (entropy, varentropy) for small LMs
☆98Updated 2 months ago
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆190Updated 8 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Updated 4 months ago
xjdr-alt / entropix-local
smol models are fun too
☆93Updated 8 months ago
PrimeIntellect-ai / prime-rl
Decentralized RL Training at Scale
☆400Updated this week
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆62Updated 8 months ago
cloneofsimo / ptx-tutorial-by-aislop
PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)
☆66Updated 4 months ago
LeonGuertler / UnstableBaselines
☆94Updated this week
xjdr-alt / muzero_sketch
☆38Updated last year
doomslide / attention-graph
A graph visualization of attention
☆56Updated 2 months ago
naklecha / llm-inference-optimizations-explained
in this repository, i'm going to implement increasingly complex llm inference optimizations
☆64Updated 2 months ago
samefarrar / entropix_mlx
Modify Entropy Based Sampling to work with Mac Silicon via MLX
☆49Updated 8 months ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆82Updated 3 months ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆277Updated last week
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆94Updated 2 weeks ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 7 months ago
Figura-Labs-Inc / telegraf_nv_export
Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.
☆62Updated last year
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆71Updated 5 months ago
okarthikb / state-space-models
☆27Updated last year
PrimeIntellect-ai / genesys
☆130Updated 4 months ago
brendanhogan / DeepSeekRL-Extended
Exploring Applications of GRPO
☆244Updated 3 weeks ago
leloykun / modded-nanogpt
NanoGPT (124M) quality in 2.67B tokens
☆28Updated last month