premAI-io / serverless-examples
π End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam
β27Updated last year
Alternatives and similar repositories for serverless-examples:
Users that are interested in serverless-examples are comparing it to the libraries listed below
- A miniature version of Modalβ20Updated 10 months ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Modelβ40Updated last year
- Using modal.com to process FineWeb-edu dataβ20Updated 2 weeks ago
- Build Agentic workflows with function calling using open LLMsβ26Updated 2 weeks ago
- β50Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ39Updated 2 months ago
- β28Updated last year
- β41Updated 2 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β66Updated 5 months ago
- Data extraction with LLM on CPUβ68Updated last year
- β12Updated last year
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, iβ¦β18Updated 2 months ago
- β20Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsβ21Updated 5 months ago
- Writing Blog Posts with Generative Feedback Loops!β47Updated last year
- Cerule - A Tiny Mighty Vision Modelβ67Updated 7 months ago
- alternative way to calculating self attentionβ18Updated 11 months ago
- β112Updated 4 months ago
- β38Updated last year
- DiffusionWithAutoscalerβ29Updated last year
- β1Updated 9 months ago
- β48Updated last year
- π Unstructured Data Connectors for Haystack 2.0β16Updated last year
- BH hackathonβ14Updated last year
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- Embed anything.β29Updated 11 months ago
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZβ64Updated last year
- β19Updated 6 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year