aahouzi / llama2-chatbot-cpu
A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.
☆13Updated 11 months ago
Alternatives and similar repositories for llama2-chatbot-cpu:
Users that are interested in llama2-chatbot-cpu are comparing it to the libraries listed below
- Medical Mixture of Experts LLM using Mergekit.☆20Updated 10 months ago
- LangChain Baby AGI integrated as a Web App using Databutton☆16Updated last year
- Question Answer Generation App using Mistral 7B, Langchain, and FastAPI.☆63Updated last year
- GGUF Quantization of any LLM.☆35Updated 10 months ago
- Chat with Your Data App using Langchain, ChromaDB, Sentence Transformers, and LaMiNi LM Model. This Chatbot is completely powered by Open…☆17Updated last year
- Multimodal AI App using Llava 7B and Gradio.☆38Updated 9 months ago
- A Gradio app for chatting with PDFs☆50Updated 5 months ago
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.☆23Updated last year
- This is a RAG implementation using Open Source stack. BioMistral 7B has been used to build this app along with PubMedBert as an embedding…☆64Updated 11 months ago
- Medical Help App using GPT-4V☆24Updated last year
- Building Private Healthcare AI Assistant for Clinics Using Qdrant Hybrid Cloud, DSPy and Groq - Llama3☆20Updated 8 months ago
- ☆32Updated last year
- CrewAI AgentOps: Monitor your AI Agents☆15Updated 7 months ago
- ☆59Updated last year
- A library for evaluating Retrieval-Augmented Generation (RAG) systems (The traditional ways).☆30Updated 5 months ago
- OpenAI API _ Chatbot implementation☆28Updated 10 months ago
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆13Updated 2 years ago
- ☆19Updated last year
- On-device LLM Inference using Mediapipe LLM Inference API.☆21Updated 10 months ago
- Simple Chainlit app to have interaction with your documents.☆51Updated 11 months ago
- ChatCSV bot using Llama 2, Sentence Transformers, CTransformers, Langchain, and Streamlit.☆65Updated 8 months ago
- Answering Questions With HuggingFace And LLM☆16Updated last year
- ☆20Updated 7 months ago
- ☆10Updated 7 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆19Updated 4 months ago
- Retrieval Augmented Generation (RAG) on audio data with LangChain☆12Updated last year
- ☆15Updated 10 months ago
- Medical RAG QA App using Meditron 7B LLM, Qdrant Vector Database, and PubMedBERT Embedding Model.☆50Updated last year
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆20Updated 11 months ago
- Agentic RAG using Crew AI.☆24Updated 7 months ago