aahouzi / llama2-chatbot-cpu
A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.
☆13Updated last year
Alternatives and similar repositories for llama2-chatbot-cpu
Users that are interested in llama2-chatbot-cpu are comparing it to the libraries listed below
Sorting:
- On-device LLM Inference using Mediapipe LLM Inference API.☆21Updated last year
- Chat with Your Data App using Langchain, ChromaDB, Sentence Transformers, and LaMiNi LM Model. This Chatbot is completely powered by Open…☆17Updated last year
- Multimodal AI App using Llava 7B and Gradio.☆38Updated last year
- Automatic Generation of Visualizations and Infographics with LLMs and Streamlit for your CSV data.☆32Updated last year
- Medical Help App using GPT-4V☆25Updated last year
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆13Updated last year
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Fine Tuning Model for different NLP task☆14Updated 2 years ago
- Agentic RAG using Crew AI.☆27Updated 10 months ago
- This is a RAG implementation using Open Source stack. BioMistral 7B has been used to build this app along with PubMedBert as an embedding…☆72Updated last month
- PubMed Healthcare Chatbot. LLM Augmented Q&A over PubMed Search Engine.☆22Updated last year
- ☆63Updated last year
- Playing with RAG using Ollama, Langchain, and Streamlit. This project aims to demonstrate how a recruiter or HR personnel can benefit fro…☆15Updated last year
- Metadata Enrichment using KeyBERT for advanced and improved RAG.☆10Updated last year
- YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smooth…☆56Updated last year
- CrewAI AgentOps: Monitor your AI Agents☆17Updated 10 months ago
- Investment Banker Chatbot using Intel's Neural Chat 7B LLM, BGE Embeddings, ChromaDB, Langchain, and CTransformers.☆17Updated last year
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆17Updated 5 months ago
- This is streamlit app that gives you the perplexity and burstiness scores for LLM generated responses. It helps you detect some sort of p…☆16Updated last year
- A multi-agent business consultant app on streamlit implemented using crewAI☆17Updated 10 months ago
- ☆34Updated last year
- Groq-Whisper Fast Transcription App built using Groq API and Streamlit.☆23Updated 7 months ago
- Question Answer Generation App using Mistral 7B, Langchain, and FastAPI.☆65Updated last year
- This is a RAG implementation using Open Source stack. BioMistral 7B has been used to build this app along with PubMedBert as an embedding…☆17Updated 9 months ago
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆20Updated last year
- llmware RAG Demo App.☆17Updated last year
- Scraping Wikipedia by combining LangChain's agents and tools with OpenAI's LLMs and function calling☆27Updated last year
- A simple Sentiment Analysis API in FastAPI.☆13Updated 4 months ago
- Chainlit app for advanced RAG. Uses llamaparse, langchain, qdrant and models from groq.☆45Updated 11 months ago
- Answering Questions With HuggingFace And LLM☆16Updated last year