Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Predibase’s LoRAX framework inference server.
☆18Apr 12, 2024Updated last year
Alternatives and similar repositories for Efficiently-Serving-LLMs
Users that are interested in Efficiently-Serving-LLMs are comparing it to the libraries listed below
Sorting:
- Deploy, launch and use LLMs on AWS☆16Jun 2, 2023Updated 2 years ago
- A set of tips and tricks to assist in the Certified Kubernetes Application Developer exam by Cloud Native Computing Foundation.☆93Dec 20, 2022Updated 3 years ago
- Deploy SageMaker models with Terraform☆23Feb 14, 2018Updated 8 years ago
- Super Mario is a legendary game we all cherish! In this project, we will deploy Super Mario on Amazon EKS (Elastic Kubernetes Service) us…☆11Feb 3, 2026Updated last month
- This repository will take you through creating a FastAPI StableDiffusion app (including Dockerfile) all the way to adding a new feature u…☆38Nov 9, 2022Updated 3 years ago
- Repository containing Anki Flashcards & source code to hopefully learn/revise any language☆11Jan 30, 2026Updated last month
- Repository for React Fundamentals classroom demonstration contacts app☆11Nov 19, 2024Updated last year
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- ☆16May 26, 2025Updated 9 months ago
- This is the official repository for the paper "Words That Unite The World: A Unified Framework for Deciphering Global Central Bank Commun…☆17Oct 19, 2025Updated 4 months ago
- This is code depository for my upcoming session. Will update details post the session☆40Jan 29, 2023Updated 3 years ago
- Integrating with Spotify API and extracting Data. Deploying code on AWS Lambda for Data Extraction. Adding trigger to run the extraction …☆11Jul 5, 2023Updated 2 years ago
- It gives you a step by step approach to predict binary data using linear regression.☆11Feb 28, 2021Updated 5 years ago
- ☆11Apr 8, 2024Updated last year
- ☆10Aug 18, 2021Updated 4 years ago
- ☆12Jun 17, 2023Updated 2 years ago
- Google Cloud Platform (GCP) CLI and utils☆14May 6, 2023Updated 2 years ago
- Free tool to copy CSVs from https://chartink.com/☆15Sep 7, 2025Updated 6 months ago
- A simple example to showcase machine learning model deployment with an API☆10Mar 7, 2022Updated 4 years ago
- A LLM-friendly framework for translating dynamical equations to gymnasium-compatible RL environments.☆33Oct 16, 2025Updated 4 months ago
- COMET for African languages☆10Jan 24, 2025Updated last year
- For my IBM Data Science Professional certificate capstone project in early 2020, I used pandas, the FourSquare API, Folium, and other Pyt…☆13Dec 31, 2020Updated 5 years ago
- Ideas on how to quickly learn to build command-line tools☆11Feb 26, 2022Updated 4 years ago
- Python script to automate AMI backup of AWS EC2 instances☆13May 31, 2014Updated 11 years ago
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- Saturn Cloud starter code for LLM Zoomcamp☆11Jul 1, 2024Updated last year
- ☆23Apr 28, 2025Updated 10 months ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- Machine Learning for Internet of Things☆12Jul 24, 2019Updated 6 years ago
- Python scripts for AWS using boto3 SDK☆10May 31, 2018Updated 7 years ago
- ☆10Aug 31, 2023Updated 2 years ago
- A repository containing link to some my Kaggle starter Notebooks☆11Jun 1, 2020Updated 5 years ago
- Repository has all the information & resouces to setup and configure monitoring & logging on EKS cluster☆13Jul 8, 2024Updated last year
- ☆11Oct 19, 2018Updated 7 years ago
- This repository contains publicly available speech and text data in Luganda.☆12Sep 4, 2020Updated 5 years ago
- ☆13Apr 12, 2023Updated 2 years ago
- ☆10Jul 1, 2020Updated 5 years ago
- oneNeuron Pytroch basics course docs plus code☆10Mar 14, 2022Updated 3 years ago
- Build a Streamlit app with LangChain and Amazon Bedrock - Use ElastiCache Serverless Redis for chat history, deploy to EKS and manage per…☆14Jan 12, 2024Updated 2 years ago