ksm26 / Efficiently-Serving-LLMsView external linksLinks
Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Predibase’s LoRAX framework inference server.
☆17Apr 12, 2024Updated last year
Alternatives and similar repositories for Efficiently-Serving-LLMs
Users that are interested in Efficiently-Serving-LLMs are comparing it to the libraries listed below
Sorting:
- Deploy, launch and use LLMs on AWS☆16Jun 2, 2023Updated 2 years ago
- A set of tips and tricks to assist in the Certified Kubernetes Application Developer exam by Cloud Native Computing Foundation.☆93Dec 20, 2022Updated 3 years ago
- Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inferen…☆19Sep 5, 2023Updated 2 years ago
- Deploy SageMaker models with Terraform☆23Feb 14, 2018Updated 8 years ago
- Super Mario is a legendary game we all cherish! In this project, we will deploy Super Mario on Amazon EKS (Elastic Kubernetes Service) us…☆11Feb 3, 2026Updated last week
- This repository will take you through creating a FastAPI StableDiffusion app (including Dockerfile) all the way to adding a new feature u…☆38Nov 9, 2022Updated 3 years ago
- This is the official repository for the paper "Words That Unite The World: A Unified Framework for Deciphering Global Central Bank Commun…☆17Oct 19, 2025Updated 3 months ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- Repository for React Fundamentals classroom demonstration contacts app☆11Nov 19, 2024Updated last year
- ☆16May 26, 2025Updated 8 months ago
- Integrating with Spotify API and extracting Data. Deploying code on AWS Lambda for Data Extraction. Adding trigger to run the extraction …☆11Jul 5, 2023Updated 2 years ago
- It gives you a step by step approach to predict binary data using linear regression.☆10Feb 28, 2021Updated 4 years ago
- ☆11Apr 8, 2024Updated last year
- ☆12Jun 17, 2023Updated 2 years ago
- Google Cloud Platform (GCP) CLI and utils☆14May 6, 2023Updated 2 years ago
- ☆10Jul 1, 2020Updated 5 years ago
- Repository has all the information & resouces to setup and configure monitoring & logging on EKS cluster☆13Jul 8, 2024Updated last year
- A repository containing link to some my Kaggle starter Notebooks☆11Jun 1, 2020Updated 5 years ago
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- collection of work pertinent to integrating Roberta with fastai:☆11Nov 11, 2020Updated 5 years ago
- ☆11Oct 19, 2018Updated 7 years ago
- Machine Learning for Internet of Things☆12Jul 24, 2019Updated 6 years ago
- This project scrapes text from Telugu books(Novels)☆10Aug 3, 2021Updated 4 years ago
- oneNeuron Pytroch basics course docs plus code☆10Mar 14, 2022Updated 3 years ago
- This is a terraform script for coast optimization using lambda. So, this script can set up a cron(schedule) to start and stop ec2 servers…☆10Feb 26, 2022Updated 3 years ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- Machine Learning Model and Deployment for Classification of Mango Varieties☆10Dec 22, 2022Updated 3 years ago
- Python scripts for AWS using boto3 SDK☆10May 31, 2018Updated 7 years ago
- ☆13Apr 12, 2023Updated 2 years ago
- This repository contains publicly available speech and text data in Luganda.☆12Sep 4, 2020Updated 5 years ago
- Python script to automate AMI backup of AWS EC2 instances☆13May 31, 2014Updated 11 years ago
- Exercises for the CERN Openlab GPU lecture☆12Jul 22, 2025Updated 6 months ago
- LangSmithGo is a Golang-based client library designed to interface with the LangSmith API for tracking and monitoring large language mode…☆16Feb 13, 2025Updated last year
- ☆21Apr 28, 2025Updated 9 months ago
- Simple demo for Databricks!☆14Sep 11, 2023Updated 2 years ago
- Netflix Clone React App☆10Aug 20, 2023Updated 2 years ago
- Machine Learning Projects on IOT sensor data☆10May 6, 2019Updated 6 years ago
- ☆11Aug 12, 2024Updated last year
- ☆18Oct 16, 2024Updated last year