furiousteabag / vram-calculatorLinks
Transformer GPU VRAM estimator
☆66Updated last year
Alternatives and similar repositories for vram-calculator
Users that are interested in vram-calculator are comparing it to the libraries listed below
Sorting:
- ☆116Updated 9 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆53Updated 2 months ago
- Train, tune, and infer Bamba model☆136Updated 5 months ago
- inference code for mixtral-8x7b-32kseqlen☆102Updated last year
- Pivotal Token Search☆131Updated 4 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Updated 10 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- ☆67Updated last year
- The DPAB-α Benchmark☆30Updated 10 months ago
- Self-host LLMs with vLLM and BentoML☆156Updated 3 weeks ago
- ☆102Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated last month
- ☆64Updated 7 months ago
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆145Updated 8 months ago
- Just a bunch of benchmark logs for different LLMs☆118Updated last year
- Train your own SOTA deductive reasoning model☆108Updated 8 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- Simple examples using Argilla tools to build AI☆56Updated last year
- ScalarLM - a unified training and inference stack☆93Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆53Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆62Updated 2 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆98Updated 5 months ago
- Google TPU optimizations for transformers models☆122Updated 9 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆89Updated last year
- look how they massacred my boy☆63Updated last year
- ☆163Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆100Updated last week
- ☆45Updated 2 years ago
- The backend behind the LLM-Perf Leaderboard☆11Updated last year