intel / ipex-llmLinks

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

☆8,130

Alternatives and similar repositories for ipex-llm

Users that are interested in ipex-llm are comparing it to the libraries listed below

Sorting:

Cabletutu / analytics-zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
☆24Updated 5 years ago
intel / BigDL
BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray
☆2,678Updated last month
yahoo / TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
☆3,870Updated 2 years ago
intel / intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
☆1,910Updated last week
salesforce / TransmogrifAI
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…
☆2,264Updated last year
kubeflow / kubeflow
Machine Learning Toolkit for Kubernetes
☆15,084Updated last month
polyaxon / polyaxon
MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
☆3,655Updated 2 weeks ago
PAIR-code / facets
Visualizations for machine learning datasets
☆7,375Updated 2 years ago
intel / ipex-llm-tutorial
Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm
☆165Updated 2 months ago
uxlfoundation / oneDNN
oneAPI Deep Neural Network Library (oneDNN)
☆3,841Updated this week
maxpumperla / elephas
Distributed Deep learning with Keras & Spark
☆1,571Updated 2 years ago
horovod / horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
☆14,550Updated 3 weeks ago
yahoo / CaffeOnSpark
Distributed deep learning on Hadoop and Spark clusters.
☆1,259Updated 5 years ago
apache / mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…
☆20,806Updated last year
apache / tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆12,454Updated this week
tensorflow / ecosystem
Integration of TensorFlow with other open-source frameworks
☆1,371Updated 9 months ago
PipelineAI / pipeline
PipelineAI
☆4,171Updated last year
tensorflow / serving
A flexible, high-performance serving system for machine learning models
☆6,305Updated this week
apache / arrow
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
☆15,709Updated this week
microsoft / SPTAG
A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distribu…
☆4,926Updated this week
h2oai / h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…
☆7,228Updated last week
dmlc / xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…
☆27,124Updated this week
FlagOpen / FlagEmbedding
Retrieval and Retrieval-augmented LLMs
☆10,191Updated last week
facebookresearch / faiss
A library for efficient similarity search and clustering of dense vectors.
☆36,192Updated this week
ucbrise / clipper
A low-latency prediction-serving system
☆1,416Updated 4 years ago
dmlc / nnvm
☆1,658Updated 6 years ago
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,334Updated 2 weeks ago
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆52,682Updated this week
SeldonIO / seldon-core
An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
☆4,582Updated this week
rapidsai / cudf
cuDF - GPU DataFrame Library
☆9,055Updated this week