A straightforward method to reduce your LLM inference API costs and token usage.
☆24May 18, 2025Updated last year
Alternatives and similar repositories for save-llm-api-cost
Users that are interested in save-llm-api-cost are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 11 months ago
- ☆15Jun 26, 2026Updated last week
- Implementation of contextual engineering pipeline with LangChain and LangGraph Agents☆92Jul 29, 2025Updated 11 months ago
- A lightweight, alignment-free utility for detecting repeat-containing reads in short-read WGS, WES and RNA-seq data.☆19Jan 16, 2026Updated 5 months ago
- Pipeline for metagenomic community analysis using DNA isolated from virus-like particles☆13Mar 21, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Handling Big Data with Knowledge Graph: A Detailed Guide☆30May 11, 2025Updated last year
- ☆13Apr 29, 2025Updated last year
- ☆11Sep 8, 2025Updated 9 months ago
- This project promulgates an automated end-to-end ML pipeline that trains a biLSTM network for sentiment analysis, experiment tracking, be…☆16Feb 1, 2023Updated 3 years ago
- ☆72Mar 9, 2026Updated 3 months ago
- Dream for [OpenSpec](https://github.com/Fission-AI/OpenSpec)☆82Jun 14, 2026Updated 2 weeks ago
- ☆29Jul 29, 2025Updated 11 months ago
- NixOS module with some useful features for hacked nintendo switch☆21Jul 18, 2024Updated last year
- A NixOS UX subsystem for unified desktop appearance and behavior☆50Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- HAT is a set of tools for calling de novo variants from whole-genome sequencing data.☆24Nov 18, 2025Updated 7 months ago
- Splits fastq files evenly☆24Aug 19, 2020Updated 5 years ago
- Custom KDE Plasma setup blending macOS aesthetics and GNOME colors for a consistent look across Qt and GTK apps.☆17Feb 15, 2025Updated last year
- A list of developer portfolios for your inspiration☆15Nov 1, 2024Updated last year
- Different Types of Prompt Engineering Techniques☆65May 13, 2025Updated last year
- Mixture of Experts from scratch☆14Apr 12, 2024Updated 2 years ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- My personal Wayland Desktop Shell☆48Nov 28, 2025Updated 7 months ago
- ☆10Jun 22, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- FlawlessChips is a C# library that provides gate-level simulation of various 8-bit chips.☆10Mar 15, 2026Updated 3 months ago
- ☆40Jan 14, 2026Updated 5 months ago
- A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)☆12Mar 20, 2023Updated 3 years ago
- Training framework for Large Behavioral Models☆28Sep 17, 2025Updated 9 months ago
- Minimal TPU implementation with 8x8 systolic array and PyTorch integration☆63Jan 26, 2026Updated 5 months ago
- A Beginner's Guide to Monetizing Your Python AI Chatbot☆17Apr 22, 2025Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- Write health checks as NixOS options to quickly verify if your services are properly running.☆51Aug 16, 2025Updated 10 months ago
- Advice for FOSDEM attendees☆66Feb 5, 2026Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- libusb-win32 DLL for Wine☆17Jan 11, 2023Updated 3 years ago
- [KDD 2025] Fine-tuning Multimodal Large Language Models for Product Bundling☆16Sep 20, 2025Updated 9 months ago
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆14Aug 18, 2023Updated 2 years ago
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 8 months ago
- Multi-Agent LLM System for Digital Scam Protection☆15Dec 19, 2024Updated last year
- The official pytorch implementation of our proposed model MISSL (ICDE-24).☆13Dec 8, 2023Updated 2 years ago