FrugalGPT: better quality and lower cost for LLM applications
☆248Feb 10, 2025Updated last year
Alternatives and similar repositories for FrugalGPT
Users that are interested in FrugalGPT are comparing it to the libraries listed below
Sorting:
- Mixing Language Models with Self-Verification and Meta-Verification☆112Dec 12, 2024Updated last year
- Framework for Cost-Effective Language Model Choice☆16Dec 12, 2023Updated 2 years ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆154Jun 13, 2024Updated last year
- Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication☆21Mar 21, 2024Updated 2 years ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆31Jun 1, 2024Updated last year
- Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Agent, Audio, Image, Video, Music and 3D…☆37Feb 4, 2025Updated last year
- ☆11Mar 14, 2024Updated 2 years ago
- MLOps Model Factory is an end to end workflow that supports generating multiple models and used for deployment to any target.☆10May 9, 2024Updated last year
- A multimodal live AI assistant designed to enhance the browsing experience using Gemini.☆11Feb 15, 2025Updated last year
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆25Nov 14, 2023Updated 2 years ago
- ☆30Jun 19, 2023Updated 2 years ago
- The code of RouterDC☆71Apr 14, 2025Updated 11 months ago
- Implementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind☆110Feb 29, 2024Updated 2 years ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32May 29, 2024Updated last year
- A Django-based web application that simplifies exam lifecycle management from creation to grading, integrating OCR and AI for an automate…☆10Jul 7, 2024Updated last year
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆389Apr 30, 2024Updated last year
- ☆105Jun 30, 2024Updated last year
- Automatically post images from a subreddit to an instagram account.☆10Feb 24, 2022Updated 4 years ago
- ☆16Jul 23, 2024Updated last year
- To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact r…☆13Mar 4, 2024Updated 2 years ago
- ☆641Nov 10, 2025Updated 4 months ago
- Parallel String Graph Construction, Transitive Reduction, and Contig Generation for De Novo Genome Assembly☆16Jun 11, 2024Updated last year
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆368Dec 9, 2023Updated 2 years ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆14Updated this week
- Automatically evaluate your LLMs in Google Colab☆687May 7, 2024Updated last year
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆28Dec 8, 2023Updated 2 years ago
- A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"☆207Mar 12, 2026Updated last week
- Using multiple LLMs for ensemble Forecasting☆16Jan 17, 2024Updated 2 years ago
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆10Sep 4, 2025Updated 6 months ago
- Accepted LLM Papers in NeurIPS 2024☆37Oct 13, 2024Updated last year
- ☆18Mar 9, 2023Updated 3 years ago
- Convert an audio file to a waveform video☆11Nov 10, 2023Updated 2 years ago
- The data processing pipeline for the Koala chatbot language model☆118Apr 6, 2023Updated 2 years ago
- Designed to help lawyers and legal professionals find precedent fast and prepare for case negotiations by simulating trajectories☆10Oct 16, 2024Updated last year
- Official repository of "Pareto Manifold Learning: Tackling multiple tasks via ensembles of single-task models" [ICML 2023]☆23Jan 10, 2025Updated last year
- ☆11Sep 25, 2025Updated 5 months ago
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆13Feb 14, 2024Updated 2 years ago