intel / ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
☆7,220Updated this week
Alternatives and similar repositories for ipex-llm:
Users that are interested in ipex-llm are comparing it to the libraries listed below
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,673Updated last month
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,875Updated last year
- oneAPI Deep Neural Network Library (oneDNN)☆3,726Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆35,530Updated this week
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,252Updated last year
- Open standard for machine learning interoperability☆18,467Updated this week
- A library for efficient similarity search and clustering of dense vectors.☆33,077Updated this week
- A flexible, high-performance serving system for machine learning models☆6,230Updated this week
- Distributed deep learning on Hadoop and Spark clusters.☆1,263Updated 5 years ago
- Microsoft Distributed Machine Learning Toolkit☆2,749Updated 6 years ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,389Updated 2 weeks ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,028Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆15,648Updated this week
- Simple and Distributed Machine Learning☆5,097Updated 2 weeks ago
- Benchmarks of approximate nearest neighbor libraries in Python☆5,107Updated 3 weeks ago
- Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit☆17,554Updated last year
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,788Updated last year
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆13,504Updated 6 months ago
- Distributed Deep learning with Keras & Spark☆1,572Updated last year
- Intel® Nervana™ reference deep learning framework committed to best performance on all hardware☆3,870Updated 4 years ago
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆8,769Updated this week
- Parallel computing with task scheduling☆12,950Updated this week
- DyNet: The Dynamic Neural Network Toolkit☆3,428Updated last year
- OpenVINO™ is an open source toolkit for optimizing and deploying AI inference☆7,838Updated this week
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,040Updated this week
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,781Updated 3 years ago
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆16,963Updated this week
- Theano was a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays…☆9,927Updated last year
- Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm☆158Updated 6 months ago
- http://torch.ch☆9,019Updated 2 years ago