Deploy and Scale LLM-based applications
☆26Jun 15, 2023Updated 2 years ago
Alternatives and similar repositories for llms-in-prod-workshop-2023
Users that are interested in llms-in-prod-workshop-2023 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fine-tuning LLMs on Flyte and Union Cloud☆30Dec 1, 2023Updated 2 years ago
- Memory footprint reduction for transformer models☆11Jan 24, 2023Updated 3 years ago
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Mar 20, 2024Updated 2 years ago
- Sentence Embedding as a Service☆15Jun 30, 2025Updated 9 months ago
- A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector database☆59Aug 1, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple demo showing how to use the Ideogram inpainting model on Replicate using Node.js.☆14Oct 24, 2024Updated last year
- ☆22Jul 11, 2023Updated 2 years ago
- App to visualize events happening for the August 21st Solar Eclipse in the US☆11Aug 1, 2017Updated 8 years ago
- Large Language Model (LLM) Serving Paper and Resource List☆28May 18, 2025Updated 11 months ago
- ChatGPT over SMS using Twilio Programmable Messaging, OpenAI API, Flask☆12Jan 22, 2023Updated 3 years ago
- ☆12Dec 8, 2020Updated 5 years ago
- Chrome Extension that submits to Last.fm music playing on SoundCloud —☆11Jan 16, 2017Updated 9 years ago
- A tool that democratizes and standardizes access to Web APIs.☆14Mar 2, 2023Updated 3 years ago
- Simply, faster, sentence-transformers☆144Aug 27, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Accompanying code for the Starting Up talk☆10Apr 24, 2025Updated 11 months ago
- Cloudflare Worker to analyze SEO of input website using Cloudflare browser rendering and Llama-3.2 hosted on Workers AI☆12Jan 22, 2025Updated last year
- ☆11Feb 21, 2018Updated 8 years ago
- Google Calendar API v3. Haskell implementation☆12Sep 27, 2014Updated 11 years ago
- Streamlit apps on Cloud Run with Identity-Aware Proxy (IAP).☆10Mar 5, 2022Updated 4 years ago
- Online model serving with Fraud Detection model trained with XGBoost on IEEE-CIS dataset☆18Jun 26, 2023Updated 2 years ago
- Multimodal Chat with Gemini API☆47Dec 25, 2023Updated 2 years ago
- Fine-Tuning Falcon-7B, LLAMA 2 with QLoRA to create an advanced AI model with a profound understanding of the Indian legal context.☆96Feb 4, 2024Updated 2 years ago
- Stateful polling app built with Hono, Cloudflare Pages, Cloudflare Workers AI, and Hono for the NBA Finals☆19Jun 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This Cloudflare Worker emails me an analysis/summary of the top 10 Hacker News stories every hour using Cloudflare Email, Workers AI, AI …☆22Feb 28, 2025Updated last year
- GPT-4 & Claude for Figma & FigJam☆20Jul 25, 2023Updated 2 years ago
- An open-source, self-hosted, full-code alternative to Customer.io. Write your user engagement logic in pure TypeScript and SQL (Drizzle O…☆14Nov 18, 2024Updated last year
- ☆15Aug 18, 2025Updated 8 months ago
- Send sms text messages using the Amazon Echo (Alexa) and the Twilio REST Api.☆12May 30, 2016Updated 9 years ago
- Code for my talk on CoreAnimation Archives☆16Jun 3, 2019Updated 6 years ago
- A lightweight, self-improving research system powered by Cloudflare Workers and OpenAI. Uses Durable Objects to recursively generate, ver…☆25Feb 1, 2025Updated last year
- Examples showing use of NGC containers and models withing Amazon SageMaker☆17Oct 4, 2022Updated 3 years ago
- Update your Twitter status right from Chrome's Omnibox (URL bar).☆20Sep 7, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- Small extensions of the Bellman-Ford routines in NetworkX, primarily for convenience☆13May 7, 2018Updated 7 years ago
- Node.js cwiid asynchronous native bindings☆31Apr 5, 2012Updated 14 years ago
- A Baltimore Sun analysis of the condition of bridges in the state of Maryland, with a focus on bridges in the Baltimore area.☆15Oct 24, 2018Updated 7 years ago
- A roadmap for aspiring AI engineers☆14Oct 14, 2023Updated 2 years ago
- This project scrapes text from Telugu books(Novels)☆10Aug 3, 2021Updated 4 years ago
- A simple example to showcase machine learning model deployment with an API☆10Mar 7, 2022Updated 4 years ago