premAI-io / serverless-examples
π End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam
β27Updated 10 months ago
Alternatives and similar repositories for serverless-examples:
Users that are interested in serverless-examples are comparing it to the libraries listed below
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsβ21Updated 2 months ago
- Build Agentic workflows with function calling using open LLMsβ26Updated 2 weeks ago
- Cerule - A Tiny Mighty Vision Modelβ67Updated 5 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β59Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 7 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β64Updated 3 months ago
- Easy to use, High Performant Knowledge Distillation for LLMsβ46Updated last month
- β48Updated last year
- BH hackathonβ14Updated 10 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 2 months ago
- Using multiple LLMs for ensemble Forecastingβ16Updated last year
- Simple AI agents / assistantsβ41Updated 4 months ago
- β24Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated last month
- Set of scripts to finetune LLMsβ36Updated 10 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β79Updated 8 months ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Modelβ40Updated last year
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ37Updated last year
- β111Updated last month
- alternative way to calculating self attentionβ18Updated 8 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ100Updated 2 months ago
- Routing on Random Forest (RoRF)β112Updated 4 months ago
- β32Updated last month
- β20Updated last year
- Ongoing research training transformer models at scaleβ34Updated last year
- A miniature version of Modalβ19Updated 8 months ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.β45Updated 11 months ago
- Streamlit app for recommending eval functions using prompt diffsβ27Updated last year