furiousteabag / vram-calculatorLinks
Transformer GPU VRAM estimator
☆65Updated last year
Alternatives and similar repositories for vram-calculator
Users that are interested in vram-calculator are comparing it to the libraries listed below
Sorting:
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆52Updated last year
- GRDN.AI app for garden optimization☆70Updated last year
- Pivotal Token Search☆104Updated last month
- Train, tune, and infer Bamba model☆127Updated 3 weeks ago
- inference code for mixtral-8x7b-32kseqlen☆100Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆62Updated 10 months ago
- LLM family chart☆51Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆118Updated 9 months ago
- Your buddy in the (L)LM space.☆64Updated 9 months ago
- Chat Markup Language conversation library☆55Updated last year
- ☆66Updated last year
- First token cutoff sampling inference example☆30Updated last year
- Embedding models from Jina AI☆60Updated last year
- ☆47Updated last year
- Testing LLM reasoning abilities with lineage relationship quizzes.☆28Updated 3 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated 3 weeks ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆137Updated 11 months ago
- Because it's there.☆16Updated 9 months ago
- An OpenAI Completions API compatible server for NLP transformers models☆65Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- ☆39Updated 2 years ago
- Command line tool for Deep Infra cloud ML inference service☆31Updated last year
- ☆116Updated 4 months ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆21Updated last year
- ☆23Updated 4 months ago
- ☆73Updated last year
- Benchmarking tool for assessing LLM models' performance across different hardwares☆17Updated last year
- Access the Cohere Command R family of models☆37Updated 2 months ago
- ☆37Updated this week