intel / ipex-llmLinks
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
☆8,595Updated 2 months ago
Alternatives and similar repositories for ipex-llm
Users that are interested in ipex-llm are comparing it to the libraries listed below
Sorting:
- Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray☆24Updated 5 years ago
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,690Updated last month
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,860Updated 2 years ago
- A flexible, high-performance serving system for machine learning models☆6,343Updated 3 weeks ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,647Updated last month
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,001Updated 2 weeks ago
- Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm☆169Updated 8 months ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆27,829Updated this week
- oneAPI Deep Neural Network Library (oneDNN)☆3,953Updated this week
- Machine Learning Toolkit for Kubernetes☆15,387Updated this week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,691Updated this week
- Distributed Deep learning with Keras & Spark☆1,578Updated 2 years ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,704Updated this week
- A low-latency prediction-serving system☆1,421Updated 4 years ago
- Open standard for machine learning interoperability☆20,114Updated this week
- MLeap: Deploy ML Pipelines to Production☆1,530Updated this week
- PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)☆23,564Updated this week
- Distributed deep learning on Hadoop and Spark clusters.☆1,261Updated 6 years ago
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,270Updated 2 years ago
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,461Updated this week
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,829Updated 2 years ago
- OpenVINO™ is an open source toolkit for optimizing and deploying AI inference☆9,460Updated this week
- An open source ML system for the end-to-end data science lifecycle☆1,077Updated this week
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆12,588Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆18,909Updated this week
- A flexible framework of neural networks for deep learning☆5,912Updated 2 years ago
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,189Updated last week
- PredictionIO, a machine learning server for developers and ML engineers.☆12,536Updated 5 years ago
- Benchmarks of approximate nearest neighbor libraries in Python☆5,563Updated 7 months ago
- Integration of TensorFlow with other open-source frameworks☆1,373Updated last year