premAI-io / serverless-examplesLinks
🚀 End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beam
☆28Updated last year
Alternatives and similar repositories for serverless-examples
Users that are interested in serverless-examples are comparing it to the libraries listed below
Sorting:
- Cerule - A Tiny Mighty Vision Model☆68Updated 2 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- GRDN.AI app for garden optimization☆69Updated 2 months ago
- BH hackathon☆14Updated last year
- ☆52Updated 2 years ago
- ☆68Updated last year
- Build Agentic workflows with function calling using open LLMs☆28Updated 2 weeks ago
- ☆119Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated 2 years ago
- ☆20Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Updated last year
- ☆40Updated 8 months ago
- Verbosity control for AI agents☆66Updated last year
- A miniature version of Modal☆23Updated last year
- ☆64Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Apps that run on modal.com☆12Updated 4 months ago
- alternative way to calculating self attention☆18Updated last year
- Notebooks using the Neural Magic libraries 📓☆39Updated last year
- inference code for mixtral-8x7b-32kseqlen☆105Updated 2 years ago
- Gradio UI for a Cog API☆71Updated last year
- Explore new advancements like ChatGPT’s function calling capability, and build a conversational agent using a new syntax called LangChain…☆15Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Updated 9 months ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆77Updated 2 years ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 2 weeks ago
- Simple AI agents / assistants☆51Updated last year
- Simple Graph Memory for AI applications☆90Updated 8 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year