Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Predibase’s LoRAX framework inference server.
☆19Apr 12, 2024Updated 2 years ago
Alternatives and similar repositories for Efficiently-Serving-LLMs
Users that are interested in Efficiently-Serving-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Human Evaluation Benchmark for Text Simplification☆10Sep 6, 2018Updated 7 years ago
- Encountering 14 different Naive RAG fails and using KG to solve it☆24Dec 4, 2025Updated 4 months ago
- A set of tips and tricks to assist in the Certified Kubernetes Application Developer exam by Cloud Native Computing Foundation.☆93Dec 20, 2022Updated 3 years ago
- Deploy SageMaker models with Terraform☆23Feb 14, 2018Updated 8 years ago
- Deploy, launch and use LLMs on AWS☆16Jun 2, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Klexikon: A German Dataset for Joint Summarization and Simplification☆17Oct 5, 2022Updated 3 years ago
- Simplification Automatic evaluation Measure through Semantic Annotation☆17Mar 11, 2019Updated 7 years ago
- CNN for Text Classification in Pytorch☆19Nov 27, 2017Updated 8 years ago
- ☆10Aug 18, 2021Updated 4 years ago
- Здесь собирается каталог ссылок на полезные языковые ресурсы башкирского языка☆16Jul 25, 2024Updated last year
- ☆25May 9, 2022Updated 3 years ago
- Alignment and annotation for comparable documents.☆22Oct 16, 2018Updated 7 years ago
- oneNeuron Pytroch basics course docs plus code☆10Mar 14, 2022Updated 4 years ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of the G-CORE graph query language on Spark☆15Aug 25, 2021Updated 4 years ago
- ☆14Oct 11, 2023Updated 2 years ago
- Super Mario is a legendary game we all cherish! In this project, we will deploy Super Mario on Amazon EKS (Elastic Kubernetes Service) us…☆11Feb 3, 2026Updated 2 months ago
- Nebula docker image for development☆16Apr 1, 2026Updated 2 weeks ago
- A Python package that helps you create paths relative to the project root☆15Dec 27, 2022Updated 3 years ago
- Repository for React Fundamentals classroom demonstration contacts app☆11Nov 19, 2024Updated last year
- ☆14Jul 28, 2024Updated last year
- Community detection in complex networks using hybrid quantum annealing on Amazon Braket☆13Jul 6, 2023Updated 2 years ago
- A SSD-based graph processing engine for billion-node graphs☆12Feb 1, 2015Updated 11 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- Official code of our work, VCSR: Mutable CSR Graph Format Using Vertex-Centric Packed Memory Array [CCGrid 2022].☆13Jun 30, 2022Updated 3 years ago
- This repository contains publicly available speech and text data in Luganda.☆12Sep 4, 2020Updated 5 years ago
- For my IBM Data Science Professional certificate capstone project in early 2020, I used pandas, the FourSquare API, Folium, and other Pyt…☆13Dec 31, 2020Updated 5 years ago
- You’ll explore new advancements like ChatGPT’s function calling capability, and build a conversational agent using a new syntax called La…☆16Oct 28, 2023Updated 2 years ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆11Sep 16, 2024Updated last year
- Russian morphological tagset converters library.☆43Oct 4, 2019Updated 6 years ago
- Exercises for the CERN Openlab GPU lecture☆12Jul 22, 2025Updated 8 months ago
- AI-driving Vehicle Simulation using Machine Learning(CNN) | PyTorch implementation of "End to End Learning for Self-Driving Cars" (arXiv:…☆21Jan 18, 2020Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆16Oct 22, 2023Updated 2 years ago
- Packed Memory Array☆17May 14, 2014Updated 11 years ago
- The specification of the LDBC Financial Benchmark☆19Jan 9, 2026Updated 3 months ago
- Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book a…☆10Oct 21, 2020Updated 5 years ago
- Docker best practices☆24Oct 7, 2022Updated 3 years ago
- Nebula Graph Client API in Rust.☆20Dec 4, 2023Updated 2 years ago
- Extracts parallel corpora from the 2 raw texts in different languages.☆37Nov 4, 2022Updated 3 years ago