isEmmanuelOlowe / llm-cost-estimatorLinks
Estimating hardware and cloud costs of LLMs and transformer projects
☆18Updated 2 months ago
Alternatives and similar repositories for llm-cost-estimator
Users that are interested in llm-cost-estimator are comparing it to the libraries listed below
Sorting:
- Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs☆110Updated last year
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆79Updated 7 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week
- ☆51Updated last year
- Data preparation code for Amber 7B LLM☆91Updated last year
- ☆41Updated 4 months ago
- Cascade Speculative Drafting☆29Updated last year
- Samples of good AI generated CUDA kernels☆89Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- QuIP quantization☆57Updated last year
- Pre-training code for CrystalCoder 7B LLM☆55Updated last year
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated 2 years ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆64Updated last year
- [ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation☆175Updated last year
- Open Implementations of LLM Analyses☆106Updated 10 months ago
- Repository for CPU Kernel Generation for LLM Inference☆26Updated 2 years ago
- ☆34Updated 3 weeks ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆88Updated this week
- LLM Optimize is a proof-of-concept library for doing LLM (large language model) guided blackbox optimization.☆58Updated 2 years ago
- PB-LLM: Partially Binarized Large Language Models☆153Updated last year
- ☆54Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Train, tune, and infer Bamba model☆131Updated 2 months ago
- ☆54Updated 9 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 8 months ago
- new optimizer☆20Updated last year
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆27Updated 4 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 9 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- ☆74Updated 5 months ago