stanford-futuredata / FrugalGPTView external linksLinks
FrugalGPT: better quality and lower cost for LLM applications
☆245Feb 10, 2025Updated last year
Alternatives and similar repositories for FrugalGPT
Users that are interested in FrugalGPT are comparing it to the libraries listed below
Sorting:
- Mixing Language Models with Self-Verification and Meta-Verification☆112Dec 12, 2024Updated last year
- Framework for Cost-Effective Language Model Choice☆17Dec 12, 2023Updated 2 years ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆154Jun 13, 2024Updated last year
- Efficient LLM query routing via multi-sampling. BEST-Route selects both model and number of responses based on query difficulty, cutting …☆43Aug 6, 2025Updated 6 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆29Jun 1, 2024Updated last year
- Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication☆21Mar 21, 2024Updated last year
- Designed to help lawyers and legal professionals find precedent fast and prepare for case negotiations by simulating trajectories☆10Oct 16, 2024Updated last year
- ☆30Jun 19, 2023Updated 2 years ago
- A Python library for creating adversarial splits☆14Jul 24, 2022Updated 3 years ago
- ☆11Mar 14, 2024Updated last year
- RFCs for standardcompletions.org☆25Jun 11, 2025Updated 8 months ago
- ☆11Apr 21, 2025Updated 9 months ago
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- 🔎 A Prodigy plugin for evaluating spaCy pipelines☆13Mar 26, 2024Updated last year
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆388Apr 30, 2024Updated last year
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆14Updated this week
- ☆14Sep 20, 2024Updated last year
- ☆32Jul 8, 2024Updated last year
- Resources and notebooks to accompany the Duplicate Detection using GenAI paper☆16Jul 1, 2024Updated last year
- ☆15Jun 10, 2024Updated last year
- ☆14Mar 31, 2024Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32May 29, 2024Updated last year
- ☆16Dec 17, 2020Updated 5 years ago
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆369Dec 9, 2023Updated 2 years ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Sep 21, 2023Updated 2 years ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- [preprint] PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration☆40Jan 7, 2026Updated last month
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- ☆16Jul 23, 2024Updated last year
- ☆17Jan 25, 2021Updated 5 years ago
- AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.☆15Dec 22, 2025Updated last month
- Automatically evaluate your LLMs in Google Colab☆685May 7, 2024Updated last year
- Implementation for our TOIS paper --- Attentive Long Short-Term Preference Modeling for Personalized Product Search.☆19Feb 14, 2020Updated 6 years ago
- ☆19Aug 7, 2024Updated last year
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Dec 30, 2023Updated 2 years ago
- ☆641Nov 10, 2025Updated 3 months ago
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆54Oct 29, 2025Updated 3 months ago
- This repository contains a toy implementation of a basic RAQA system.☆20Jun 3, 2024Updated last year