metaskills / llamafile-on-lambdaLinks
Serverless AI Inference with Gemma 2 using Mozilla's llamafile on AWS Lambda
☆11Updated last year
Alternatives and similar repositories for llamafile-on-lambda
Users that are interested in llamafile-on-lambda are comparing it to the libraries listed below
Sorting:
- Deploy llama.cpp compatible Generative AI LLMs on AWS Lambda!☆177Updated last year
- A Lightweight Library for AI Observability☆255Updated 11 months ago
- Turn a fresh Linux installation into a fully configured, sleek, and modern on device AI development system by running a single command.☆102Updated 8 months ago
- An implemention of GraphRAG using open source small LLMs☆14Updated last year
- ♾️ Helix is a private GenAI stack for building AI agents with declarative pipelines, knowledge (RAG), API bindings, and first-class testi…☆712Updated this week
- Advanced document extraction and chunking techniques for retrieval augmented generation that is aware of the layout of documents. Increas…☆114Updated 2 months ago
- LLM-powered document chat using Amazon Bedrock and AWS Serverless☆294Updated last week
- Materials for the LLM Evals Workshop from Weights & BIases☆14Updated 11 months ago
- Deep Research for your internal data☆351Updated 8 months ago
- ☆38Updated last year
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆108Updated last year
- ☆14Updated last year
- ☆297Updated 10 months ago
- ☆181Updated 2 years ago
- ☆345Updated last year
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆274Updated 9 months ago
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆31Updated 2 years ago
- ☆90Updated 2 years ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆401Updated 2 weeks ago
- A lightweight express.js server implementing OpenAI’s Responses API, built on top of Chat Completions, powered by Hugging Face Inference …☆223Updated 6 months ago
- Constrain LLM output☆113Updated last year
- ☆62Updated 2 years ago
- Build Generative AI applications with Langchain on AWS☆182Updated 2 years ago
- ☆89Updated 9 months ago
- A non-official CLI for Llama Index Parser☆216Updated last year
- ☆74Updated last year
- A Python SDK for optimizing prompts for Amazon Nova Models.☆54Updated 3 weeks ago
- A fully in-browser privacy solution to make Conversational AI privacy-friendly☆234Updated last year
- A framework for generative software.☆115Updated 7 months ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆41Updated last year