premAI-io / serverless-examplesLinks
π End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam
β28Updated last year
Alternatives and similar repositories for serverless-examples
Users that are interested in serverless-examples are comparing it to the libraries listed below
Sorting:
- Cerule - A Tiny Mighty Vision Modelβ67Updated 11 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β84Updated last year
- BH hackathonβ14Updated last year
- Gradio UI for a Cog APIβ69Updated last year
- GRDN.AI app for garden optimizationβ70Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 10 months ago
- β63Updated 8 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β59Updated last month
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ111Updated 4 months ago
- β67Updated last year
- Arxflix turns your boring Arxiv research paper into a captivating video.β52Updated this week
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsβ22Updated 9 months ago
- β55Updated last month
- β116Updated 8 months ago
- β20Updated last year
- Using modal.com to process FineWeb-edu dataβ20Updated 4 months ago
- Simple program to manually caption your images (or any other file types) so you can use them for AI trainingβ37Updated 2 years ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β45Updated last year
- Notebooks using the Neural Magic libraries πβ39Updated last year
- A miniature version of Modalβ20Updated last year
- β121Updated last year
- β16Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ55Updated 6 months ago
- Quickly and securely turn any Linux box into a build and deployment assistantβ24Updated 11 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ103Updated 8 months ago
- alternative way to calculating self attentionβ18Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- Apps that run on modal.comβ12Updated last month