Serverless LLM Inference: Deploy DeepSeek R1 & LLaMA Models on AWS Lambda with Ultra-Fast Cold Starts
☆13Feb 3, 2026Updated 3 months ago
Alternatives and similar repositories for sample-serverless-llama-server
Users that are interested in sample-serverless-llama-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆14Jan 3, 2023Updated 3 years ago
- Build a Streamlit app with LangChain and Amazon Bedrock - Use ElastiCache Serverless Redis for chat history, deploy to EKS and manage per…☆14Jan 12, 2024Updated 2 years ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- 一个全平台的 Python CPU 性能测试工具及榜单。☆15Nov 11, 2023Updated 2 years ago
- Turn Trello into a CMS to power all your websites and apps.☆10May 12, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22May 9, 2025Updated last year
- ☆15Jun 26, 2024Updated last year
- Code of our paper "Attribute Graph Neural Networks for Strict Cold Start Recommendation" accepted by TKDE 2020.☆15Dec 11, 2020Updated 5 years ago
- Nano implementation of TOML using Markty.☆16Jun 22, 2022Updated 3 years ago
- Ejemplo de microservicio usando https://github.com/zeit/micro para scrappear un perfíl de Platzi☆12Nov 11, 2017Updated 8 years ago
- ☆19Feb 18, 2025Updated last year
- Procedural data generators suite for synthetic pretraining and formal reasoning☆38Updated this week
- 🤖 A robotic pick-and-place solution for the Flipkart GRID 5.0 Finals. Features real-time object detection (YOLO), inverse kinematics, an…☆10Jun 23, 2025Updated 10 months ago
- Binaris Function as a Service CLI☆20Nov 24, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Jan 9, 2025Updated last year
- WebRTC AEC3 fully working demo using Qt6 Audio☆14Apr 27, 2026Updated 3 weeks ago
- ☆24Apr 15, 2024Updated 2 years ago
- ☆29Feb 3, 2026Updated 3 months ago
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain only☆33Jul 19, 2024Updated last year
- ☆13Mar 23, 2023Updated 3 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- Iterate fast on your RAG pipelines☆24Jun 21, 2025Updated 11 months ago
- Convert a nodejs .cpuprofile to a renderable flame graph node☆12Jan 3, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Whisper inference with TensorRT-LLM☆25Sep 22, 2023Updated 2 years ago
- Verify blockchain data presented at popular websites using Light Client technology☆11Nov 24, 2024Updated last year
- Official Repo for the 30DaysOfFLCode Challenge Initiative☆76Aug 28, 2025Updated 8 months ago
- Effective frame sampling for ML applications.☆27Aug 30, 2025Updated 8 months ago
- comp9417 machine learning and data mining notes and work☆20Sep 14, 2019Updated 6 years ago
- A mod for the Amplitude Studios Humankind game.☆15Apr 28, 2025Updated last year
- A fluent API for generating Java byte code☆14Apr 4, 2013Updated 13 years ago
- ☆30Jul 22, 2024Updated last year
- ☆12Jul 20, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Vue.jsでPWA作って、LambdaでSSRしてるリポジトリ☆19Aug 25, 2017Updated 8 years ago
- Markdown-it plugin that adds Font Awesome icons support☆21Apr 16, 2023Updated 3 years ago
- Modelo de TC do IF Goiano campus Ceres em Latex☆13May 13, 2019Updated 7 years ago
- animated introduction screen slide package for flutter☆13Nov 7, 2024Updated last year
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- 💬 Comics and 📖 Ebook web library written in rust 🦀🚀 (Almost usable)☆16May 5, 2026Updated 2 weeks ago
- Rust wrapper for the broswer FileSystem API☆14Feb 14, 2025Updated last year