FareedKhan-dev / save-llm-api-costLinks
A straightforward method to reduce your LLM inference API costs and token usage.
☆18Updated 7 months ago
Alternatives and similar repositories for save-llm-api-cost
Users that are interested in save-llm-api-cost are comparing it to the libraries listed below
Sorting:
- Implementation of contextual engineering pipeline with LangChain and LangGraph Agents☆72Updated 4 months ago
- ☆52Updated last year
- Composition of Multimodal Language Models From Scratch☆15Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated 11 months ago
- Improving langchain knowledge graphs using baml☆36Updated 4 months ago
- AI Multi-agent system for real-time, adaptive supply chain coordination and optimization leveraging responsive AI clusters.☆34Updated last year
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- Implementation of 12 AI agents evaluation techniques☆29Updated 4 months ago
- ☆67Updated 8 months ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆33Updated last year
- ☆63Updated last year
- An agent to generate stunning images :)☆23Updated 6 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆17Updated 2 months ago
- Building LLMs from scratch following the book from S. Raschka☆32Updated 8 months ago
- Finetune any model on HF in less than 30 seconds☆56Updated 2 months ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆35Updated last month
- ☆40Updated last year
- Multi-Agent LLM System for Digital Scam Protection☆11Updated last year
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆21Updated last year
- UQ: Assessing Language Models on Unsolved Questions☆29Updated 3 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 7 months ago
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆70Updated last year
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT-MoE (~85M params). Fast, creative text generation …☆15Updated 2 weeks ago
- Design Patterns for Multi Agents Frameworks Like Autogen, Langraph, Taskweaver, Crewai,etc☆69Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated 11 months ago
- ☆55Updated 3 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆35Updated last year
- Synthetic Data Generation using LLM via Argilla, Distilabel, ChatGPT, etc.☆30Updated last year
- Building LLaMA 4 MoE from Scratch☆68Updated 8 months ago