intel-staging / Langchain-Chatchat
Knowledge Base QA using RAG pipeline on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with IPEX-LLM
☆16Updated this week
Alternatives and similar repositories for Langchain-Chatchat:
Users that are interested in Langchain-Chatchat are comparing it to the libraries listed below
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆35Updated this week
- Estimating hardware and cloud costs of LLMs and transformer projects☆11Updated last year
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆22Updated last year
- ☆41Updated this week
- GPT2 implementation in C++ using Ort☆26Updated 4 years ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week
- Port of Facebook's LLaMA model in C/C++☆20Updated last year
- ☆12Updated this week
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 5 months ago
- A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support☆15Updated 4 years ago
- Automatic Test Generator☆11Updated last year
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆106Updated last month
- ☆52Updated 9 months ago
- Source of the website of the BigCode project.☆19Updated this week
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆38Updated last month
- ☆25Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆27Updated 11 months ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆58Updated last month
- Tools for the LLaMA language model☆12Updated last year
- Fork of llama.cpp, extended for GPT-NeoX, RWKV-v4, and Falcon models☆30Updated last year
- AMD related optimizations for transformer models☆64Updated 2 months ago
- A super simple web interface to perform blind tests on LLM outputs.☆27Updated 10 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆14Updated 2 months ago
- the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly☆28Updated 3 months ago
- Intel® SHMEM - Device initiated shared memory based communication library☆22Updated 2 months ago
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models☆36Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated last year
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆68Updated 6 months ago
- GraphRag vs Embeddings☆13Updated 6 months ago
- LLM-powered autonomous agent with hierarchical task management☆47Updated last year