isEmmanuelOlowe / llm-cost-estimator
Estimating hardware and cloud costs of LLMs and transformer projects
☆14Updated last year
Alternatives and similar repositories for llm-cost-estimator:
Users that are interested in llm-cost-estimator are comparing it to the libraries listed below
- ☆44Updated 8 months ago
- AI Assistant running within your browser.☆61Updated 3 months ago
- Compression for Foundation Models☆27Updated 2 weeks ago
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆80Updated this week
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated last year
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆111Updated last year
- Lightweight Llama 3 8B Inference Engine in CUDA C☆46Updated 2 weeks ago
- ☆44Updated 7 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆40Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 3 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆26Updated 3 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week
- LLMs as Collaboratively Edited Knowledge Bases☆44Updated last year
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 10 months ago
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆18Updated 5 months ago
- Github repo for Peifeng's internship project☆13Updated last year
- ☆28Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆102Updated 4 months ago
- Tutorial to get started with SkyPilot!☆57Updated 9 months ago
- BH hackathon☆14Updated 11 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 6 months ago
- QuIP quantization☆51Updated 11 months ago
- ☆21Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆18Updated last month
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆26Updated last year
- ☆37Updated 4 months ago