furiousteabag / vram-calculatorLinks
Transformer GPU VRAM estimator
☆66Updated last year
Alternatives and similar repositories for vram-calculator
Users that are interested in vram-calculator are comparing it to the libraries listed below
Sorting:
- ☆116Updated 6 months ago
- Pivotal Token Search☆118Updated 3 weeks ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆51Updated this week
- Public reports detailing responses to sets of prompts by Large Language Models.☆31Updated 7 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆82Updated 2 months ago
- Train, tune, and infer Bamba model☆130Updated 2 months ago
- ☆66Updated last year
- ☆51Updated last month
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆61Updated 2 months ago
- ☆63Updated 4 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- Simple high-throughput inference library☆125Updated 2 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Simple examples using Argilla tools to build AI☆53Updated 8 months ago
- Benchmarking suite for popular AI APIs☆87Updated 6 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆88Updated last year
- Self-host LLMs with vLLM and BentoML☆139Updated last week
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated last month
- Train your own SOTA deductive reasoning model☆103Updated 5 months ago
- 👷 Build compute kernels☆87Updated this week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 5 months ago
- ☆157Updated last year
- ☆115Updated 7 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- ☆199Updated last year
- 1.58-bit LLaMa model☆81Updated last year
- Lego for GRPO☆28Updated 2 months ago
- ☆74Updated last year