intel-staging / Langchain-ChatchatLinks
Knowledge Base QA using RAG pipeline on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with IPEX-LLM
☆17Updated 9 months ago
Alternatives and similar repositories for Langchain-Chatchat
Users that are interested in Langchain-Chatchat are comparing it to the libraries listed below
Sorting:
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆31Updated 7 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16Updated 8 months ago
- Port of Facebook's LLaMA model in C/C++☆21Updated 2 years ago
- Simple LLM inference server☆20Updated last year
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆53Updated last year
- Locally running LLM with internet access☆97Updated 7 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Updated last year
- the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly☆32Updated last year
- Explore our open source AI portfolio! Develop, train, and deploy your AI solutions with performance- and productivity-optimized tools fro…☆66Updated 10 months ago
- powerful and fast tool calling agents☆80Updated 10 months ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Updated last month
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆19Updated last year
- ☆46Updated 2 years ago
- For individual users, watsonx Code Assistant can access a local IBM Granite model☆37Updated 7 months ago
- AI system powered by large language models.☆33Updated this week
- Examples of calling OpenRouter models from Python code☆88Updated 9 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆93Updated 2 weeks ago
- ☆15Updated 2 years ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆115Updated 6 months ago
- ☆40Updated last year
- ☆15Updated 2 years ago
- Code generation with LLMs 🔗☆53Updated 2 years ago
- Mixture-of-Ollamas☆30Updated last year
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆18Updated last year
- EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU☆50Updated last year
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆41Updated last year
- A vllm proxy server to add security and multi model management for vllm servers☆12Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆242Updated last year