premAI-io / serverless-examples
π End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam
β27Updated last year
Alternatives and similar repositories for serverless-examples:
Users that are interested in serverless-examples are comparing it to the libraries listed below
- BH hackathonβ14Updated 11 months ago
- Using multiple LLMs for ensemble Forecastingβ16Updated last year
- A miniature version of Modalβ20Updated 9 months ago
- β28Updated last year
- Using modal.com to process FineWeb-edu dataβ20Updated 3 weeks ago
- β20Updated last year
- β66Updated 10 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 4 months ago
- β48Updated last year
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ37Updated last year
- β24Updated last year
- β49Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 8 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.β25Updated 9 months ago
- Verbosity control for AI agentsβ60Updated 10 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.β33Updated last year
- Modified Stanford-Alpaca Trainer for Training Replit's Code Modelβ40Updated last year
- β111Updated 3 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structureβ46Updated 5 months ago
- Writing Blog Posts with Generative Feedback Loops!β47Updated last year
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaciβ¦β24Updated last year
- Embed anything.β29Updated 10 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Modelsβ21Updated 4 months ago
- β16Updated 10 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectoβ¦β43Updated last year
- Chat Markup Language conversation libraryβ55Updated last year
- Simple AI agents / assistantsβ43Updated 5 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async APIβ45Updated 6 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β64Updated 4 months ago
- Cerule - A Tiny Mighty Vision Modelβ67Updated 6 months ago