Learn the ins and outs of efficiently serving Large Language Models (LLMs). Dive into optimization techniques, including KV caching and Low Rank Adapters (LoRA), and gain hands-on experience with Predibase’s LoRAX framework inference server.
☆19Apr 12, 2024Updated 2 years ago
Alternatives and similar repositories for Efficiently-Serving-LLMs
Users that are interested in Efficiently-Serving-LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Human Evaluation Benchmark for Text Simplification☆10Sep 6, 2018Updated 7 years ago
- A monolingual parallel corpus for sentence simplification☆11Jul 4, 2016Updated 9 years ago
- A set of tips and tricks to assist in the Certified Kubernetes Application Developer exam by Cloud Native Computing Foundation.☆93May 15, 2026Updated 2 weeks ago
- A Parallel Russian-Simple Russian Dataset☆17Mar 30, 2023Updated 3 years ago
- Deploy SageMaker models with Terraform☆23Feb 14, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Deploy, launch and use LLMs on AWS☆16Jun 2, 2023Updated 2 years ago
- AI Software Bill of Materials for EU AI Act☆12Jan 18, 2024Updated 2 years ago
- Здесь собирается каталог ссылок на полезные языковые ресурсы башкирского языка☆17Jul 25, 2024Updated last year
- Repo for the simplified text alignment tools.☆21Dec 4, 2020Updated 5 years ago
- Concurrent inverse Bloom filter.☆15Feb 3, 2015Updated 11 years ago
- Alignment and annotation for comparable documents.☆22Oct 16, 2018Updated 7 years ago
- ☆10Aug 24, 2023Updated 2 years ago
- Machine Learning for Mathematics Faculty (HSE) 2018☆18Jan 23, 2022Updated 4 years ago
- oneNeuron Pytroch basics course docs plus code☆10Mar 14, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- COMET for African languages☆11Jan 24, 2025Updated last year
- ☆14Oct 11, 2023Updated 2 years ago
- A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning☆16Jul 20, 2020Updated 5 years ago
- LLM Evals Leaderboard☆49Nov 21, 2023Updated 2 years ago
- Repository for React Fundamentals classroom demonstration contacts app☆11Nov 19, 2024Updated last year
- ☆14Jul 28, 2024Updated last year
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- A Java library for quantum programming using Quil.☆16Jul 23, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Resources to learn data processing with GPT and other language models☆21Dec 10, 2024Updated last year
- This repository contains publicly available speech and text data in Luganda.☆12Sep 4, 2020Updated 5 years ago
- For my IBM Data Science Professional certificate capstone project in early 2020, I used pandas, the FourSquare API, Folium, and other Pyt…☆13Dec 31, 2020Updated 5 years ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- This repository will take you through creating a FastAPI StableDiffusion app (including Dockerfile) all the way to adding a new feature u…☆38Nov 9, 2022Updated 3 years ago
- Russian morphological tagset converters library.☆43Oct 4, 2019Updated 6 years ago
- AI-driving Vehicle Simulation using Machine Learning(CNN) | PyTorch implementation of "End to End Learning for Self-Driving Cars" (arXiv:…☆21Jan 18, 2020Updated 6 years ago
- Generate a dataset to finetune a LLM to generate Cypher code from questions given in natural language (English).☆15May 24, 2024Updated 2 years ago
- ☆16Oct 22, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆40Updated this week
- Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book a…☆10Oct 21, 2020Updated 5 years ago
- Extracts parallel corpora from the 2 raw texts in different languages.☆37Nov 4, 2022Updated 3 years ago
- Analysis using reduced NanoAOD files created from CMS open data producing a high statistics di-muon spectrum☆15Sep 5, 2023Updated 2 years ago
- AI assisted Quantum technologies☆16Nov 27, 2022Updated 3 years ago
- Experimentation on google's gemma model☆16Mar 6, 2024Updated 2 years ago
- This project scrapes text from Telugu books(Novels)☆10Aug 3, 2021Updated 4 years ago