99991 / pyggufLinks
GGUF parser in Python
☆27Updated 9 months ago
Alternatives and similar repositories for pygguf
Users that are interested in pygguf are comparing it to the libraries listed below
Sorting:
- QuIP quantization☆52Updated last year
- Some random tools for working with the GGUF file format☆26Updated last year
- ☆46Updated last week
- ☆74Updated 6 months ago
- A safetensors extension to efficiently store sparse quantized tensors on disk☆117Updated this week
- Simple high-throughput inference library☆115Updated 3 weeks ago
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models☆35Updated last year
- Experiments with BitNet inference on CPU☆55Updated last year
- Python bindings for ggml☆141Updated 9 months ago
- Gpu benchmark☆63Updated 4 months ago
- A fast RWKV Tokenizer written in Rust☆45Updated 2 months ago
- RWKV-7: Surpassing GPT☆88Updated 6 months ago
- Repository for CPU Kernel Generation for LLM Inference☆26Updated last year
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Updated last year
- ☆21Updated 3 months ago
- ☆24Updated 8 months ago
- Example of applying CUDA graphs to LLaMA-v2☆12Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated last year
- Explore training for quantized models☆18Updated last week
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆34Updated 11 months ago
- ☆71Updated 2 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 7 months ago
- ☆130Updated 2 months ago
- ☆17Updated last year
- FlexAttention w/ FlashAttention3 Support☆26Updated 8 months ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆72Updated 4 months ago
- ☆53Updated last year
- ☆119Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆11Updated last year