intel-staging / Langchain-ChatchatLinks
Knowledge Base QA using RAG pipeline on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with IPEX-LLM
β17Updated 8 months ago
Alternatives and similar repositories for Langchain-Chatchat
Users that are interested in Langchain-Chatchat are comparing it to the libraries listed below
Sorting:
- the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidlyβ32Updated last year
- π©π€π€ A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)β24Updated 2 years ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.β49Updated 2 months ago
- Port of Facebook's LLaMA model in C/C++β22Updated 2 years ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Siliconβ16Updated 8 months ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cppβ167Updated 8 months ago
- KAN (KolmogorovβArnold Networks) in the MLX framework for Apple Siliconβ31Updated 6 months ago
- π₯ Health monitor for a Petals swarmβ40Updated last year
- Inference of Large Multimodal Models in C/C++. LLaVA and othersβ48Updated 2 years ago
- Implementation of nougat that focuses on processing pdf locally.β83Updated 11 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ11Updated 2 years ago
- Let GPT-4 run your Minecraft server!β10Updated 2 years ago
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.β90Updated 2 years ago
- Light WebUI for lm.rsβ24Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hoursβ66Updated last year
- A python command-line tool to download & manage MLX AI models from Hugging Face.β19Updated last year
- An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, iβ¦β19Updated 11 months ago
- Simple Implementation of a Transformer in the new framework MLX by Appleβ19Updated last year
- β14Updated 2 years ago
- Minimal, clean code implementation of RAG with mlx using gguf model weightsβ53Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.β43Updated 6 months ago
- Tools for formatting large language model prompts.β13Updated 2 years ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUsβ92Updated last year
- Tools for the LLaMA language modelβ12Updated 2 years ago
- A library for incremental loading of large PyTorch checkpointsβ56Updated 2 years ago
- mlx image models for Apple Silicon machinesβ90Updated last month
- LLM training in simple, raw C/Metal Shading Languageβ60Updated last year
- Explore our open source AI portfolio! Develop, train, and deploy your AI solutions with performance- and productivity-optimized tools froβ¦β62Updated 9 months ago
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.β17Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradioβ38Updated 2 years ago