premAI-io / serverless-examplesLinks
π End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam
β28Updated last year
Alternatives and similar repositories for serverless-examples
Users that are interested in serverless-examples are comparing it to the libraries listed below
Sorting:
- Cerule - A Tiny Mighty Vision Modelβ68Updated last month
- β118Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β85Updated last year
- β51Updated 2 years ago
- BH hackathonβ14Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β59Updated 2 months ago
- Build Agentic workflows with function calling using open LLMsβ28Updated last month
- A miniature version of Modalβ22Updated last year
- β64Updated last year
- β20Updated last year
- β68Updated last year
- β28Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- inference code for mixtral-8x7b-32kseqlenβ104Updated 2 years ago
- A high performance batching router optimises max throughput for text inference workloadβ16Updated 2 years ago
- β122Updated last year
- GRDN.AI app for garden optimizationβ69Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β69Updated last month
- Using modal.com to process FineWeb-edu dataβ20Updated 8 months ago
- Simple AI agents / assistantsβ51Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year
- alternative way to calculating self attentionβ18Updated last year
- Verbosity control for AI agentsβ64Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attentionβ119Updated last year
- Gradio UI for a Cog APIβ71Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectoβ¦β43Updated last year
- Apps that run on modal.comβ12Updated 3 months ago
- β31Updated 11 months ago
- β55Updated 3 months ago
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.β32Updated last year