metaskills / llamafile-on-lambdaLinks
Serverless AI Inference with Gemma 2 using Mozilla's llamafile on AWS Lambda
☆11Updated last year
Alternatives and similar repositories for llamafile-on-lambda
Users that are interested in llamafile-on-lambda are comparing it to the libraries listed below
Sorting:
- Deploy llama.cpp compatible Generative AI LLMs on AWS Lambda!☆177Updated last year
- A lightweight express.js server implementing OpenAI’s Responses API, built on top of Chat Completions, powered by Hugging Face Inference …☆223Updated 6 months ago
- A Lightweight Library for AI Observability☆255Updated 11 months ago
- Materials for the LLM Evals Workshop from Weights & BIases☆14Updated 11 months ago
- Demonstration of agentic capabilities in TypeScript☆110Updated last year
- Routing on Random Forest (RoRF)☆239Updated last year
- ☆181Updated 2 years ago
- ☆271Updated 9 months ago
- Pinecone + Vercel RAG application, showcasing a comparison between chat with no context and using a Pinecone index for context☆78Updated last week
- Pinecone AWS Reference Architecture☆115Updated last year
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆265Updated 2 weeks ago
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆76Updated 10 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆248Updated last year
- Constrain LLM output☆113Updated last year
- An implemention of GraphRAG using open source small LLMs☆14Updated last year
- ☆345Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆154Updated last year
- Python toolkit for building graph-enhanced GenAI applications☆358Updated this week
- An external version of a pull request for langchain.☆27Updated this week
- ☆90Updated 2 years ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆41Updated last year
- ☆38Updated last year
- Tutorial on how to properly send intermediate LlamaIndex events to vercel ai sdk via server-sent events during RAG.☆199Updated last year
- Turn a fresh Linux installation into a fully configured, sleek, and modern on device AI development system by running a single command.☆102Updated 8 months ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆108Updated last year
- 😎 Sagentic.ai Agent Framework - Sagentic.ai is a unified platform for building, running and scaling autonomous agents.☆74Updated last month
- Cloud-native, AI-powered, document processing pipelines on AWS.☆186Updated 2 weeks ago
- Repo to experiment with Graph RAG strategies using Kùzu☆64Updated 4 months ago
- A generative AI-powered framework for testing virtual agents.☆336Updated last month
- ☆54Updated last year