furiousteabag / vram-calculator
Transformer GPU VRAM estimator
☆45Updated 10 months ago
Alternatives and similar repositories for vram-calculator:
Users that are interested in vram-calculator are comparing it to the libraries listed below
- ☆48Updated last year
- ☆52Updated 9 months ago
- GRDN.AI app for garden optimization☆70Updated 11 months ago
- ☆60Updated last year
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆58Updated last month
- ☆65Updated 8 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 4 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆43Updated 11 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 8 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- ☆74Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆28Updated 3 weeks ago
- Command line tool for Deep Infra cloud ML inference service☆28Updated 7 months ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated last year
- Github repo for Peifeng's internship project☆13Updated last year
- First token cutoff sampling inference example☆29Updated last year
- Hallucinations (Confabulations) Document-Based Benchmark for RAG☆65Updated last week
- Training hybrid models for dummies.☆18Updated 2 weeks ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- Port of Facebook's LLaMA model in C/C++☆20Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆53Updated 11 months ago
- An OpenAI Completions API compatible server for NLP transformers models☆63Updated last year
- look how they massacred my boy☆63Updated 3 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 3 months ago
- ☆112Updated this week
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- A super simple web interface to perform blind tests on LLM outputs.☆27Updated 10 months ago