run-llama / llama-api
☆25Updated last year
Alternatives and similar repositories for llama-api:
Users that are interested in llama-api are comparing it to the libraries listed below
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆10Updated 9 months ago
- Opinionated Langchain setup with Qdrant vector store and Kong gateway☆31Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆22Updated last year
- A Python program leveraging OpenAI's language models to generate, analyze, and select the best answer to a given question.☆46Updated last year
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated last year
- ☆44Updated last year
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Updated last year
- ☆30Updated last year
- Example of running LangChain on Cloud Run☆61Updated last year
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆12Updated 10 months ago
- Github repo for storing LlamaDatasets☆33Updated last year
- Search through the Weaviate Podcast!☆57Updated 2 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- OpenAI functions + Chainlit + Streaming responses. Chain multiple functions in one query.☆27Updated last year
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆20Updated last year
- ☆44Updated 8 months ago
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Updated last year
- Streamlit application that helps users analyze RFP's using the latest Gemini 2.0 Flash Experimental LLM.☆12Updated 2 months ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated 11 months ago
- ☆57Updated last year
- Replit template for hosting LangChain runnables via LangServe☆39Updated last year
- ☆44Updated this week
- Not financial advice.☆28Updated last year
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …☆26Updated 6 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- ☆20Updated last year
- API to load and query documents using RAG☆15Updated last year
- examples and guides to using Nomic Atlas☆27Updated 2 weeks ago
- Answering Questions With HuggingFace And LLM☆16Updated last year