premAI-io / serverless-examplesLinks

🚀 End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam

☆28

Alternatives and similar repositories for serverless-examples

Users that are interested in serverless-examples are comparing it to the libraries listed below

Sorting:

tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 11 months ago
huggingface / discord-bots
☆50Updated last year
adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…
☆82Updated last year
iulia-b10 / multilingual-embedding-models
☆20Updated last year
4dh / GRDN
GRDN.AI app for garden optimization
☆70Updated last year
multimodalart / grog
Gradio UI for a Cog API
☆69Updated last year
camenduru / MoE-LLaVA-jupyter
☆16Updated last year
QuixiAI / kraken
☆66Updated last year
weaviate / structured-rag
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆111Updated 4 months ago
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 4 months ago
nateraw / modal-examples
Apps that run on modal.com
☆12Updated last month
teknium1 / ShareGPT-Builder
☆116Updated 7 months ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
aniketmaurya / Agents
Build Agentic workflows with function calling using open LLMs
☆28Updated this week
Datura-ai / cortex.t
☆63Updated 7 months ago
jquesnelle / literAI
Generate visual podcasts about novels using open source models
☆25Updated 2 years ago
julien-blanchon / arxflix
Arxflix turns your boring Arxiv research paper into a captivating video.
☆52Updated this week
NousResearch / finetuning-subnet
☆121Updated last year
neuralmagic / examples
Notebooks using the Neural Magic libraries 📓
☆40Updated last year
katanaml / llm-rag-invoice-cpu
Data extraction with LLM on CPU
☆68Updated last year
diicellman / dynamite-dogs
BH hackathon
☆14Updated last year
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
qnguyen3 / hermes-llava
☆54Updated last year
BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
JeezAI / DSPy_matchmaking
A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…
☆59Updated last year
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 8 months ago
pacman100 / peft-codegen-25
☆23Updated 2 years ago
Technoculture / personal-graph
Simple Graph Memory for AI applications
☆89Updated 2 months ago
charlesfrye / minimodal
A miniature version of Modal
☆20Updated last year
jacoblee93 / oss-model-extraction-evals
☆31Updated last year