generativelabs / exllama-runpod-serverlessLinks
☆17Updated 2 years ago
Alternatives and similar repositories for exllama-runpod-serverless
Users that are interested in exllama-runpod-serverless are comparing it to the libraries listed below
Sorting:
- ☆50Updated 2 years ago
- ☆53Updated 2 years ago
- ☆36Updated last year
- Local character AI chatbot with chroma vector store memory and some scripts to process documents for Chroma☆33Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆73Updated 2 years ago
- ☆45Updated 2 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆37Updated 2 years ago
- Example of calling OpenRouter from a Streamit app☆102Updated 2 years ago
- Agent with vision ability via llava & autogen☆73Updated 2 years ago
- LLM Siri with OpenAI, Perplexity, Ollama, Llama2, Mistral, Mixtral & Langchain☆60Updated last year
- The FunctionChain is a tool that simplifies and organizes the process of invoking OpenAI functions in your Node.js applications. With thi…☆54Updated 2 years ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆103Updated last year
- Conduct consumer interviews with synthetic focus groups using LLMs and LangChain☆43Updated 2 years ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆119Updated last year
- ☆61Updated 2 years ago
- An awesome & curated list of best LLMOps tools for developers☆24Updated 2 years ago
- ☆46Updated last year
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated last year
- ☆47Updated last year
- Little AI roleplay program☆59Updated 2 years ago
- Access your Ollama inference server running on your computer from anywhere. Set up with NextJS + Langchain JS LCEL + Ngrok☆26Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆70Updated 2 years ago
- The very first artist assistant☆22Updated 2 years ago
- Example code for extracting Q&A datasets from LLM's☆82Updated 2 years ago
- Ollama models of NousResearch/Hermes-2-Pro-Mistral-7B-GGUF☆31Updated last year
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆39Updated 2 years ago
- ☆89Updated last year
- An OpenAI-like LLaMA inference API☆113Updated 2 years ago
- ☆11Updated last year