isEmmanuelOlowe / llm-cost-estimatorLinks
Estimating hardware and cloud costs of LLMs and transformer projects
β16Updated last year
Alternatives and similar repositories for llm-cost-estimator
Users that are interested in llm-cost-estimator are comparing it to the libraries listed below
Sorting:
- Compression for Foundation Modelsβ31Updated 2 months ago
- π· Build compute kernelsβ44Updated this week
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundryβ42Updated last year
- Repository for CPU Kernel Generation for LLM Inferenceβ26Updated last year
- β13Updated 3 weeks ago
- QuIP quantizationβ52Updated last year
- Make triton easierβ47Updated 11 months ago
- π©π€π€ A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)β23Updated 2 years ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.β20Updated this week
- β39Updated last month
- vLLM adapter for a TGIS-compatible gRPC server.β30Updated this week
- β71Updated 2 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMsβ86Updated this week
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.β17Updated 4 months ago
- β33Updated last month
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama modelsβ35Updated last year
- A collection of reproducible inference engine benchmarksβ31Updated last month
- Tutorial to get started with SkyPilot!β57Updated last year
- Samples of good AI generated CUDA kernelsβ65Updated last week
- β48Updated 10 months ago
- Latent Large Language Modelsβ18Updated 9 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.β17Updated 2 months ago
- Nexusflow function call, tool use, and agent benchmarks.β19Updated 5 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β26Updated 6 months ago
- β46Updated last week
- Simple Model Similarities Analysisβ21Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zetaβ13Updated 6 months ago
- Simple GRPO scripts and configurations.β58Updated 4 months ago
- β44Updated last year
- β20Updated last year