premAI-io / serverless-examplesLinks
π End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam
β28Updated last year
Alternatives and similar repositories for serverless-examples
Users that are interested in serverless-examples are comparing it to the libraries listed below
Sorting:
- Cerule - A Tiny Mighty Vision Modelβ67Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β84Updated last year
- Using modal.com to process FineWeb-edu dataβ20Updated 6 months ago
- β116Updated 10 months ago
- β67Updated last year
- GRDN.AI app for garden optimizationβ70Updated last year
- β50Updated 2 years ago
- Build Agentic workflows with function calling using open LLMsβ28Updated 2 weeks ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ111Updated 6 months ago
- Verbosity control for AI agentsβ65Updated last year
- β20Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated 2 weeks ago
- BH hackathonβ13Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year
- Gradio UI for a Cog APIβ69Updated last year
- Set of scripts to finetune LLMsβ38Updated last year
- Generate visual podcasts about novels using open source modelsβ25Updated 2 years ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.β76Updated 2 years ago
- Simple AI agents / assistantsβ48Updated last year
- RAG example using DSPy, Gradio, FastAPIβ85Updated last year
- β120Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β45Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 11 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ58Updated this week
- β16Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β50Updated last year
- Unofficial implementation and experiments related to Set-of-Mark (SoM) ποΈβ87Updated 2 years ago
- β63Updated 10 months ago
- Finetune any model on HF in less than 30 secondsβ55Updated this week
- DiffusionWithAutoscalerβ29Updated last year