FareedKhan-dev / save-llm-api-costLinks
A straightforward method to reduce your LLM inference API costs and token usage.
☆21Updated 8 months ago
Alternatives and similar repositories for save-llm-api-cost
Users that are interested in save-llm-api-cost are comparing it to the libraries listed below
Sorting:
- Implementation of 12 AI agents evaluation techniques☆35Updated 6 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated last year
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Updated 7 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- ☆54Updated 3 weeks ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆36Updated 5 months ago
- Multi-Agent LLM System for Digital Scam Protection☆12Updated last year
- Encountering 14 different Naive RAG fails and using KG to solve it☆20Updated 2 months ago
- Implementation of contextual engineering pipeline with LangChain and LangGraph Agents☆80Updated 6 months ago
- AI Multi-agent system for real-time, adaptive supply chain coordination and optimization leveraging responsive AI clusters.☆35Updated last year
- An agent to generate stunning images :)☆23Updated 8 months ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆34Updated last year
- ☆14Updated last year
- ☆16Updated 2 years ago
- Multi-Agent AI App from Scratch in python without any depedency of framework☆15Updated last year
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated 2 years ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 10 months ago
- ☆24Updated last year
- Hands-on hub to learn techniques to optimize and serve AI models to production the most optimal way.☆14Updated 5 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆14Updated 2 years ago
- Creating the DeepSeek V3 model from scratch☆24Updated 10 months ago
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆52Updated last year
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆20Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Pandas-LLM☆46Updated 2 years ago
- ☆55Updated 5 months ago
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- Finetune any model on HF in less than 30 seconds☆56Updated last week
- Improving langchain knowledge graphs using baml☆42Updated 6 months ago