intel-staging / Langchain-Chatchat
Knowledge Base QA using RAG pipeline on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with IPEX-LLM
☆16Updated 3 weeks ago
Alternatives and similar repositories for Langchain-Chatchat
Users that are interested in Langchain-Chatchat are comparing it to the libraries listed below
Sorting:
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆16Updated last week
- ☆14Updated 3 months ago
- Holodeck is a project to create test environments optimised for GPU projects.☆13Updated last week
- Developer kits reference setup scripts for various kinds of Intel platforms and GPUs☆24Updated this week
- Automatic Test Generator☆12Updated last month
- A simple app to use OpenAI API to generate music☆9Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆10Updated last year
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆109Updated 2 months ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆15Updated last year
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 8 months ago
- AML's goal is to make benchmarking of various AI architectures on Ampere CPUs a pleasurable experience :)☆21Updated last week
- ☆14Updated 2 months ago
- Serving CrewAI Agent as REST API with BentoML, optionally with self-host open-source LLMs☆17Updated 4 months ago
- AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse A…☆19Updated last month
- Horizon chart for CPU/GPU/Neural Engine utilization monitoring on Apple M1/M2 and nVidia GPUs on Linux☆25Updated 3 weeks ago
- ☆15Updated last year
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆17Updated last week
- CI for ggml and related projects☆29Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆86Updated this week
- GGML implementation of BERT model with Python bindings and quantization.☆26Updated last year
- ☆14Updated 11 months ago
- 🏥 Health monitor for a Petals swarm☆37Updated 9 months ago
- a version of baby agi using dspy and typed predictors☆17Updated last year
- The Swarm Ecosystem☆20Updated 9 months ago
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- ☆35Updated this week
- GraphRag vs Embeddings☆13Updated 10 months ago
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆45Updated this week
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆26Updated 10 months ago
- Galleries for Models, Datasets, and Plugins used by Transformer Lab☆21Updated this week