FareedKhan-dev / save-llm-api-costLinks

A straightforward method to reduce your LLM inference API costs and token usage.

☆16

Alternatives and similar repositories for save-llm-api-cost

Users that are interested in save-llm-api-cost are comparing it to the libraries listed below

Sorting:

padas-lab-de / ir-rag-sigir24-persona-rag
☆47Updated 10 months ago
jmanhype / Golden-Retriever
A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…
☆33Updated 11 months ago
ashishpatel26 / ai-tutor-rag-system
This is a repository for the course "From Beginner to LLM Developer" by Towards AI.
☆11Updated 6 months ago
miralab-ai / autoreason
☆40Updated 7 months ago
ianhohoho / auto-hyde
🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…
☆32Updated last year
cxcscmu / RAGViz
Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
☆86Updated 6 months ago
camenduru / MiniGPT-v2-colab
☆29Updated last year
du-nlp-lab / MLR-Copilot
☆66Updated 3 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆63Updated 11 months ago
githubpradeep / notebooks
☆54Updated 5 months ago
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated 10 months ago
run-llama / image-generation-agent
An agent to generate stunning images :)
☆21Updated 2 months ago
ArturTanona / grpo_unsloth_docker
☆57Updated 5 months ago
ulab-uiuc / ToMAP
Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"
☆14Updated last month
kyegomez / Tiktokx
Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…
☆13Updated last year
shivamsanju / ragswift
🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform
☆38Updated last year
AI-ANK / c3-python-nostream
Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…
☆23Updated last year
ali-bahrainian / RAG_best_practices
☆94Updated 4 months ago
aniket-work / AI_Powered_Dev_Search_Engine
AI_Powered_Dev_Search_Engine
☆12Updated last year
cityzen95 / LLM_from_scratch
Building LLMs from scratch following the book from S. Raschka
☆30Updated 3 months ago
HITsz-TMG / KaLM-Embedding
Code for KaLM-Embedding models
☆86Updated 3 weeks ago
yai333 / Text-to-SQL-GRPO-Fine-tuning-Pipeline
This repository contains a pipeline for fine-tuning Large Language Models (LLMs) for Text-to-SQL conversion using General Reward Proximal…
☆31Updated 3 months ago
phunterlau / paper_without_code
LLM reads a paper and produce a working prototype
☆58Updated 3 months ago
mrmaheshrajput / productionizing-llms
Code Repository for Blog - How to Productionize Large Language Models (LLMs)
☆11Updated last year
Sayandip170900 / CUDA-Challenge
100 Days of GPU Challenge
☆21Updated last month
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
v-prgmr / mergekit
Tools for merging pretrained large language models.
☆19Updated last year
google-deepmind / llms_can_learn_rules
☆57Updated 7 months ago