Deploy and Scale LLM-based applications
☆26Jun 15, 2023Updated 3 years ago
Alternatives and similar repositories for llms-in-prod-workshop-2023
Users that are interested in llms-in-prod-workshop-2023 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python C++ Code Manager☆15Sep 29, 2024Updated last year
- This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.☆456Feb 13, 2024Updated 2 years ago
- LLM query engine to retrieve augmented responses from json files.☆15Oct 12, 2023Updated 2 years ago
- Implementation of vDNN++; an improvement over vDNN☆18Dec 7, 2018Updated 7 years ago
- Sentence Embedding as a Service☆15Jun 30, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- My first RAG application☆10Jul 29, 2024Updated last year
- A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector database☆59Aug 1, 2024Updated last year
- A simple demo showing how to use the Ideogram inpainting model on Replicate using Node.js.☆16Oct 24, 2024Updated last year
- Fast model deployment on AWS EC2☆14Feb 25, 2024Updated 2 years ago
- App to visualize events happening for the August 21st Solar Eclipse in the US☆11Aug 1, 2017Updated 8 years ago
- Super Mario is a legendary game we all cherish! In this project, we will deploy Super Mario on Amazon EKS (Elastic Kubernetes Service) us…☆11Feb 3, 2026Updated 4 months ago
- Large Language Model (LLM) Serving Paper and Resource List☆28Apr 16, 2026Updated 2 months ago
- ChatGPT over SMS using Twilio Programmable Messaging, OpenAI API, Flask☆13Jan 22, 2023Updated 3 years ago
- Template repo for kickstarting recipes for regression use case☆56Dec 10, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆12Dec 8, 2020Updated 5 years ago
- Edit an input image with an input prompt using Google's new Gemini 2.5 Flash (Nano Banana) model, Cloudflare Workers, and Cloudflare R2!☆26Aug 27, 2025Updated 9 months ago
- A tool that democratizes and standardizes access to Web APIs.☆14Mar 2, 2023Updated 3 years ago
- Simply, faster, sentence-transformers☆144Aug 27, 2024Updated last year
- A project that accompanies my talk, ReactiveCocoa made Simple with Swift☆27Sep 3, 2014Updated 11 years ago
- Cloudflare Worker to analyze SEO of input website using Cloudflare browser rendering and Llama-3.2 hosted on Workers AI☆12Jan 22, 2025Updated last year
- ☆11Feb 21, 2018Updated 8 years ago
- Google Calendar API v3. Haskell implementation☆12Sep 27, 2014Updated 11 years ago
- Online model serving with Fraud Detection model trained with XGBoost on IEEE-CIS dataset☆18Jun 26, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains the results and code for the MLPerf™ Training v1.1 benchmark.☆23May 18, 2023Updated 3 years ago
- Fast model deployment on AWS Sagemaker☆16Feb 25, 2024Updated 2 years ago
- An open-source, self-hosted, full-code alternative to Customer.io. Write your user engagement logic in pure TypeScript and SQL (Drizzle O…☆14Nov 18, 2024Updated last year
- ☆15Aug 18, 2025Updated 10 months ago
- Send sms text messages using the Amazon Echo (Alexa) and the Twilio REST Api.☆12May 30, 2016Updated 10 years ago
- Code for my talk on CoreAnimation Archives☆16Jun 3, 2019Updated 7 years ago
- A lightweight, self-improving research system powered by Cloudflare Workers and OpenAI. Uses Durable Objects to recursively generate, ver…☆25Feb 1, 2025Updated last year
- Examples showing use of NGC containers and models withing Amazon SageMaker☆17Oct 4, 2022Updated 3 years ago
- Update your Twitter status right from Chrome's Omnibox (URL bar).☆20Sep 7, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Small extensions of the Bellman-Ford routines in NetworkX, primarily for convenience☆13May 7, 2018Updated 8 years ago
- memAry @ University of Texas at Austin☆21Apr 16, 2024Updated 2 years ago
- A Baltimore Sun analysis of the condition of bridges in the state of Maryland, with a focus on bridges in the Baltimore area.☆15Oct 24, 2018Updated 7 years ago
- A roadmap for aspiring AI engineers☆15Oct 14, 2023Updated 2 years ago
- A simple example to showcase machine learning model deployment with an API☆10Mar 7, 2022Updated 4 years ago
- This project scrapes text from Telugu books(Novels)☆10Aug 3, 2021Updated 4 years ago
- ☆10Aug 12, 2024Updated last year