Deploy llama.cpp compatible Generative AI LLMs on AWS Lambda!
☆177Apr 16, 2024Updated last year
Alternatives and similar repositories for llama-on-lambda
Users that are interested in llama-on-lambda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Oct 24, 2023Updated 2 years ago
- ☆17Aug 18, 2023Updated 2 years ago
- ☆18Oct 9, 2023Updated 2 years ago
- ☆18May 4, 2025Updated 10 months ago
- Intelligent Document Processing with AWS AI/ML, published by Packt☆12Mar 2, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15May 3, 2021Updated 4 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Nov 27, 2023Updated 2 years ago
- ☆11Jul 9, 2023Updated 2 years ago
- ☆14Jul 1, 2019Updated 6 years ago
- ☆20Jun 15, 2023Updated 2 years ago
- Example showing how to run a LLM fully inside an AWS Lambda Function☆23Jan 13, 2024Updated 2 years ago
- ☆13Feb 16, 2023Updated 3 years ago
- ☆56Jun 26, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Mar 28, 2025Updated 11 months ago
- ☆12Apr 1, 2025Updated 11 months ago
- Large Language Model Hosting Container☆91Mar 11, 2026Updated 2 weeks ago
- Fuzzy Process Mining in R☆22Dec 31, 2021Updated 4 years ago
- ☆29Sep 4, 2023Updated 2 years ago
- ☆19Mar 24, 2024Updated 2 years ago
- Visual Inspection AI Edge solution infrastructure provisioning scripts☆16Nov 12, 2024Updated last year
- This project demonstrates how to parse emails, process them using OpenAI's GPT-3.5, and load the data into a Weaviate vector database for…☆22May 3, 2023Updated 2 years ago
- The SQL generation library you already know how to use.☆22Jul 20, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pretty notification box☆16May 31, 2022Updated 3 years ago
- Eventbrite clone app built with Node Express back-end and React Redux on the front-end.☆10Oct 9, 2018Updated 7 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Video decoding functions for JavaScript to decode raw h264 or VP8/VP9 frames using openh264 and libvpx☆12Feb 11, 2022Updated 4 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆39Apr 11, 2024Updated last year
- Reference Architecture to automate the use of S3 Express One Zone as a caching layer for S3 Regional Buckets.☆14Apr 14, 2025Updated 11 months ago
- Finetune Your Local LLM☆18Sep 23, 2023Updated 2 years ago
- FM-Leaderboard-er allows you to create leaderboard to find the best LLM/prompt for your own business use case based on your data, task, p…☆19Oct 31, 2024Updated last year
- Python SDK for Galileo's NLP and CV Studio.☆17Mar 17, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆23May 4, 2023Updated 2 years ago
- ☆10Jul 27, 2016Updated 9 years ago
- Generative AI in realtime with Confluent Cloud.☆28Apr 16, 2024Updated last year
- Automatic Labelling for Omnivore on AWS☆14Jan 17, 2024Updated 2 years ago
- A small Python library for NLP Interchange Format (NIF) for NER(D) systems☆19Feb 9, 2023Updated 3 years ago
- ☆12Oct 8, 2021Updated 4 years ago
- Capstone project for mlops zoomcamp☆11Sep 12, 2022Updated 3 years ago