fbaldassarri / llama-cpp-containerLinks
Docker image to deploy a llama-cpp container with conda-ready environments
☆17Updated 2 years ago
Alternatives and similar repositories for llama-cpp-container
Users that are interested in llama-cpp-container are comparing it to the libraries listed below
Sorting:
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated 2 years ago
- ☆101Updated 2 years ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆130Updated 2 years ago
- Document Q&A on Wikipedia articles using LLMs☆80Updated 2 years ago
- ☆46Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated 2 years ago
- Calling LLM APIs on a Raspberry Pi for lulz☆24Updated 2 years ago
- ☆25Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated 2 years ago
- Open Source Embeddings Optimisation and Eval Framework for RAG/LLM Applications. Documentations at https://docs.vectorboard.ai/introducti…☆50Updated 2 years ago
- TheBloke's Dockerfiles☆308Updated last year
- ☆185Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆109Updated 2 years ago
- ☆141Updated 2 years ago
- PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.☆117Updated 2 years ago
- ☆138Updated 3 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- Explore Multiple Vector Databases and chat with documents on Multiple LLM models, private LLM models☆48Updated 2 years ago
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆60Updated last year
- An OpenAI-like LLaMA inference API☆113Updated 2 years ago
- A minimalist Docker project to help people getting started with Node, WizardCoder, CTransformers, Python, Express and TypeScript. Ready t…☆14Updated 2 years ago
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆114Updated 2 years ago
- Tools for building products and apps with LLMs.☆75Updated last year
- Machine learning tool-set for Paperspace VMs☆60Updated 2 years ago
- Locally running LLM with internet access☆97Updated 7 months ago
- HuggingChat like UI in Gradio☆70Updated 2 years ago
- Experiments with open source LLMs☆74Updated last month
- FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)☆246Updated 2 years ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆38Updated 2 years ago
- Weaviate Podcast MCP☆59Updated last month