United-Compute / gpu-benchmarkLinks
Benchmark your GPU with ease
☆26Updated 4 months ago
Alternatives and similar repositories for gpu-benchmark
Users that are interested in gpu-benchmark are comparing it to the libraries listed below
Sorting:
- ☆62Updated 3 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆145Updated 7 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 8 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆42Updated last month
- ☆67Updated last year
- look how they massacred my boy☆63Updated 11 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆90Updated 4 months ago
- GPTQ and efficient search for GGUF☆50Updated 3 weeks ago
- An unsupervised model merging algorithm for Transformers-based language models.☆106Updated last year
- ☆60Updated 3 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 11 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆80Updated last week
- 1.58-bit LLaMa model☆83Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 8 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 11 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Updated last year
- ☆40Updated last year
- RWKV-7: Surpassing GPT☆97Updated 10 months ago
- entropix style sampling + GUI☆27Updated 11 months ago
- Lego for GRPO☆29Updated 4 months ago
- Inference of Mamba models in pure C☆191Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- A powerful 130-million-parameter model trained from scratch as part of a truly open-source stack, including a custom tokenizer, dataset, …☆66Updated last month
- ☆136Updated last year
- PyTorch implementation of models from the Zamba2 series.☆185Updated 8 months ago
- Modeling code for a BitNet b1.58 Llama-style model.☆25Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Samples of good AI generated CUDA kernels☆91Updated 4 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- GRDN.AI app for garden optimization☆70Updated last year