FareedKhan-dev / save-llm-api-costLinks
A straightforward method to reduce your LLM inference API costs and token usage.
☆21Updated 8 months ago
Alternatives and similar repositories for save-llm-api-cost
Users that are interested in save-llm-api-cost are comparing it to the libraries listed below
Sorting:
- ☆54Updated 3 weeks ago
- Implementation of 12 AI agents evaluation techniques☆35Updated 6 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated last year
- Implementation of contextual engineering pipeline with LangChain and LangGraph Agents☆83Updated 6 months ago
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Updated 7 months ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆34Updated last year
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆35Updated 3 months ago
- ☆14Updated last year
- ☆67Updated 10 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- ☆39Updated last year
- Encountering 14 different Naive RAG fails and using KG to solve it☆20Updated 2 months ago
- ☆16Updated 2 years ago
- Creating the DeepSeek V3 model from scratch☆24Updated 10 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- ☆63Updated last year
- ☆107Updated 10 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Updated last year
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆52Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Handling Big Data with Knowledge Graph: A Detailed Guide☆29Updated 9 months ago
- Score LLM pretraining data with classifiers☆55Updated 2 years ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 10 months ago
- Create informative READMEs effortlessly using AI-driven templates with the README Creator powered by Language Model (LLM). Simplify docum…☆13Updated 2 years ago
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT-MoE (~85M params). Fast, creative text generation …☆15Updated 2 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Updated last year
- Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"☆22Updated 4 months ago
- Measuring RAG solutions throughput and latency☆19Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated 2 years ago