Cost reduction tools and techniques for LLM based systems
☆87Jul 6, 2025Updated 9 months ago
Alternatives and similar repositories for awesome-cheap-llms
Users that are interested in awesome-cheap-llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Loan Default Prediction using PySpark, with jobs scheduled by Apache Airflow and Integration with Spark using Apache Livy☆22Dec 26, 2020Updated 5 years ago
- EnriCo: Enriched Representation and Globally Constrained Inference for Entity and Relation Extraction☆26May 22, 2024Updated last year
- ☆21May 23, 2024Updated last year
- Use `outlines` generators with Haystack.☆15Updated this week
- ☆30Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Repository for the Demo of using DVC with PyCaret & MLOps (DVC Office Hours - 20th Jan, 2022)☆11Jan 20, 2022Updated 4 years ago
- This repo contains my projects from the Udacity Data Engineering Nano degree☆13Apr 26, 2023Updated 2 years ago
- ☆13Oct 28, 2025Updated 5 months ago
- Uses Milvus and OpenAI's API to perform question answering over documents with a chat interface☆20Aug 14, 2023Updated 2 years ago
- ☆10Mar 19, 2024Updated 2 years ago
- Fine tuned LLM examples running on Kubernetes☆11Oct 1, 2023Updated 2 years ago
- ☆25Jul 14, 2021Updated 4 years ago
- ☆12Oct 15, 2021Updated 4 years ago
- projek mengenai NLP dan model deployment☆10Feb 1, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆15Oct 19, 2023Updated 2 years ago
- Example of how to build machine learning training workflow on AWS by Prefect☆12Nov 2, 2022Updated 3 years ago
- [npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction☆12May 1, 2024Updated last year
- ☆15Dec 1, 2023Updated 2 years ago
- ☆13Oct 13, 2023Updated 2 years ago
- LLM Zoomcamp - a free online course about real-life applications of LLMs. In 10 weeks you will learn how to build an AI system that answe…☆4,808Dec 1, 2025Updated 4 months ago
- Signature Verification☆13Mar 2, 2022Updated 4 years ago
- Demo of Machine Learning Prediction Model API with Django REST API Framework☆10Dec 22, 2019Updated 6 years ago
- Terraform-Based Bedrock RAG Deployment☆10Sep 17, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tayra is a sophisticated call center analytics platform designed to systematically evaluate and score call center audio interactions. By …☆14Dec 19, 2025Updated 4 months ago
- ☆14Apr 22, 2024Updated last year
- Created a Flask API🚀 which can detect toxicity in comment(text)💭 using NLP-BERT🤖. Following MLOps lifecycle🔁 to deploy ML system in p…☆14Jan 23, 2023Updated 3 years ago
- Tools to help work with bulk data when using Microsoft Purview☆14Mar 26, 2026Updated 3 weeks ago
- Instant voice cloning by MyShell. Join our Discord community https://discord.gg/myshell and select the Developer role upon joining to gai…☆10Dec 4, 2025Updated 4 months ago
- ☆13Oct 6, 2023Updated 2 years ago
- ☆13Jan 7, 2022Updated 4 years ago
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Jan 14, 2024Updated 2 years ago
- ☆31Apr 11, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Do you want to LEARN NEW STUFF for FREE? Don't worry, with the power of web-scraping and automation, this script will find the necessary …☆19Feb 29, 2024Updated 2 years ago
- A set of examples illustrating some possible use cases for NannyML☆21Oct 20, 2023Updated 2 years ago
- The Data Explorer and Machine Learning App☆14Feb 22, 2026Updated last month
- ☆16Sep 9, 2023Updated 2 years ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆85Apr 25, 2025Updated 11 months ago
- ☆14Mar 30, 2024Updated 2 years ago
- Machine Learning Engineering on AWS, published by Packt☆73Mar 2, 2026Updated last month