isEmmanuelOlowe / llm-cost-estimator
Estimating hardware and cloud costs of LLMs and transformer projects
☆11Updated last year
Alternatives and similar repositories for llm-cost-estimator:
Users that are interested in llm-cost-estimator are comparing it to the libraries listed below
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆24Updated 2 months ago
- ☆21Updated this week
- ☆43Updated 7 months ago
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆110Updated 2 months ago
- ☆25Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆17Updated 3 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 5 months ago
- ☆25Updated 11 months ago
- Luann allows you to create a LLM agent,which has complete memory module (long-term memory, short-term memory) and knowledge module(Variou…☆18Updated 3 weeks ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆14Updated 5 years ago
- Benchmarking suite for popular AI APIs☆80Updated 2 months ago
- AI Assistant running within your browser.☆57Updated last month
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆80Updated last week
- Compression for Foundation Models☆21Updated this week
- ☆58Updated 8 months ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆22Updated last year
- An all-new OS that orchestrates autonomous agents as workers to execute tasks.☆17Updated 2 months ago
- Github repo for Peifeng's internship project☆13Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆43Updated 11 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆19Updated 4 months ago
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆54Updated 4 months ago
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models☆36Updated last year
- ☆116Updated 9 months ago
- ☆11Updated 5 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- vLLM adapter for a TGIS-compatible gRPC server.☆17Updated this week
- Prototype Operator☆24Updated 7 months ago
- Simple CogVLM client script☆14Updated last year
- LLM reads a paper and produce a working prototype☆48Updated last month
- ☆25Updated last year