premAI-io / serverless-examplesLinks
π End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam
β28Updated last year
Alternatives and similar repositories for serverless-examples
Users that are interested in serverless-examples are comparing it to the libraries listed below
Sorting:
- Cerule - A Tiny Mighty Vision Modelβ66Updated 10 months ago
- BH hackathonβ14Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β81Updated last year
- β50Updated last year
- Using modal.com to process FineWeb-edu dataβ20Updated 3 months ago
- β20Updated last year
- β63Updated 6 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsβ22Updated 7 months ago
- Gradio UI for a Cog APIβ69Updated last year
- alternative way to calculating self attentionβ18Updated last year
- Build Agentic workflows with function calling using open LLMsβ28Updated last week
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated 10 months ago
- β115Updated 6 months ago
- GRDN.AI app for garden optimizationβ70Updated last year
- β22Updated last year
- Apps that run on modal.comβ12Updated last week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 8 months ago
- β66Updated last year
- β10Updated 2 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ108Updated 3 months ago
- β29Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)β21Updated 9 months ago
- Tutorial for DSPyβ23Updated last year
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Llβ¦β16Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ55Updated 5 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integratβ¦β64Updated 9 months ago
- Arxflix turns your boring Arxiv research paper into a captivating video.β52Updated last month
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structureβ49Updated 9 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β59Updated last week