RahulSChand / gpu_poor

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
1,152Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for gpu_poor