A straightforward method to reduce your LLM inference API costs and token usage.
☆22May 18, 2025Updated 10 months ago
Alternatives and similar repositories for save-llm-api-cost
Users that are interested in save-llm-api-cost are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Updated this week
- Implementation of contextual engineering pipeline with LangChain and LangGraph Agents☆88Jul 29, 2025Updated 8 months ago
- Handling Big Data with Knowledge Graph: A Detailed Guide☆30May 11, 2025Updated 11 months ago
- ☆11Sep 8, 2025Updated 7 months ago
- This project promulgates an automated end-to-end ML pipeline that trains a biLSTM network for sentiment analysis, experiment tracking, be…☆16Feb 1, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A list of developer portfolios for your inspiration☆15Nov 1, 2024Updated last year
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- ☆56Mar 13, 2026Updated 3 weeks ago
- Training framework for Large Behavioral Models☆27Sep 17, 2025Updated 6 months ago
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆14Aug 18, 2023Updated 2 years ago
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 5 months ago
- Multi-Agent LLM System for Digital Scam Protection☆13Dec 19, 2024Updated last year
- The official pytorch implementation of our proposed model MISSL (ICDE-24).☆13Dec 8, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Recommender system☆25Oct 13, 2020Updated 5 years ago
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch☆82Jun 16, 2025Updated 9 months ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- The repository of paper Personalized Multimodal Response Generation with Large Language Models☆18Jun 28, 2024Updated last year
- [EMNLP 2024] Enhancing High-order Interaction Awareness in LLM-based Recommender Model.☆13Jan 9, 2025Updated last year
- A comprehensive hands-on project for learning GPU programming with CUDA and HIP, covering fundamental concepts through advanced optimizat…☆35Nov 20, 2025Updated 4 months ago
- G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation☆20Mar 5, 2025Updated last year
- Translate Nvidia Cg shading source code to Open GL Shading source code☆13Apr 23, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- TensorRT☆11Sep 22, 2020Updated 5 years ago
- 📝🤖 WriteAI - Simplify your writing process with AI. Generate emails 📧, articles 📝, essays 📚, & more with ease. Writing is made easy …☆12Feb 21, 2023Updated 3 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆16Dec 30, 2024Updated last year
- A Snowflake SQL parser (WIP)☆11May 31, 2020Updated 5 years ago
- Python FastApi "Circuit Breaker" implementation☆13Mar 14, 2025Updated last year
- [KDD'25] Flow Matching for Collaborative Filtering☆22Sep 6, 2025Updated 7 months ago
- This is a simple user interface for YOLOv8, a popular object detection system. The program allows the user to select a video or image fil…☆11Apr 4, 2023Updated 3 years ago
- Al-Qur'an yang dikemas dalam bentuk ChatBot☆15Dec 1, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is the official code for the ACL 2025 paper "GRAM: Generative Recommendation via Semantic-aware Multi-granular Late Fusion".☆31Mar 23, 2026Updated 3 weeks ago
- Code for ICML 2025 paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning☆28Jun 18, 2025Updated 9 months ago
- AI Voice Agents: Exploring the Next Generation of Human-Machine Interaction! 🎙️🤖🎧☆10Aug 30, 2024Updated last year
- [ACM TOMM'2025] "MMHCL: Multi-Modal Hypergraph Contrastive Learning for Recommendation"☆30Aug 13, 2025Updated 8 months ago
- Official source code for AAAI 2025 paper: Augmenting Sequential Recommendation with Balanced Relevance and Diversity☆25Apr 16, 2025Updated 11 months ago
- A super-fast proxy server port scanner一个超级快的端口扫描器☆24Aug 31, 2025Updated 7 months ago
- ☆15Oct 25, 2021Updated 4 years ago