mowa-ai / llm-as-a-service
Simple FastAPI service for LLAMA-2 7B chat model
☆20Updated last year
Alternatives and similar repositories for llm-as-a-service:
Users that are interested in llm-as-a-service are comparing it to the libraries listed below
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 7 months ago
- ☆13Updated 7 months ago
- ☆19Updated 8 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- ☆32Updated last year
- This is a LlamaIndex project bootstrapped with create-llama to act as a full stack UI to accompany Retrieval-Augmented Generation (RAG) B…☆29Updated last year
- Build Agentic workflows with function calling using open LLMs☆26Updated 3 weeks ago
- Examples of Chat Bots using Panels chat features: Traditional, LLMs, AI Agents, LangChain, OpenAI etc☆117Updated 4 months ago
- BH hackathon☆14Updated last year
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Updated last year
- Github repo for storing LlamaDatasets☆33Updated last year
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆36Updated last year
- ☆11Updated 10 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- Baker is an AI powered app that helps you find recipes and avoid food waste☆14Updated 3 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 5 months ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆32Updated last year
- A repository for creating, and sample code for consuming an ONNX embedding model☆31Updated last year
- Agentic RAG with Langchain, Qdrant and CrewAI☆58Updated 11 months ago
- ☆41Updated 10 months ago
- ☆20Updated 7 months ago
- ☆45Updated 2 months ago
- On-device LLM Inference using Mediapipe LLM Inference API.☆21Updated last year
- ☆22Updated 11 months ago
- Simple Implementation of a Transformer in the new framework MLX by Apple☆20Updated 5 months ago
- ☆20Updated last year
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆21Updated last month
- ☆40Updated 2 weeks ago
- Use a LlamaIndex Agent as a backend service☆18Updated 11 months ago