intel / ipex-llmLinks
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
☆8,477Updated last month
Alternatives and similar repositories for ipex-llm
Users that are interested in ipex-llm are comparing it to the libraries listed below
Sorting:
- Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray☆24Updated 5 years ago
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,686Updated last week
- Simple and Distributed Machine Learning☆5,182Updated this week
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,866Updated 2 years ago
- Open standard for machine learning interoperability☆19,933Updated this week
- Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm☆169Updated 7 months ago
- An open source ML system for the end-to-end data science lifecycle☆1,071Updated last week
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,269Updated 2 years ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,992Updated this week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,628Updated 3 weeks ago
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,113Updated 7 months ago
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,387Updated this week
- The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, …☆23,073Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆39,982Updated this week
- A low-latency prediction-serving system☆1,419Updated 4 years ago
- Open Machine Learning Compiler Framework☆12,835Updated last week
- oneAPI Deep Neural Network Library (oneDNN)☆3,921Updated last week
- A library for time series analysis on Apache Spark☆1,195Updated 5 years ago
- A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations☆15,989Updated this week
- Distributed Deep learning with Keras & Spark☆1,575Updated 2 years ago
- Development repository for the Triton language and compiler☆17,668Updated this week
- Resource scheduling and cluster management for AI☆2,680Updated last year
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆19,810Updated last month
- A flexible, high-performance serving system for machine learning models☆6,335Updated last week
- Notes talking about the design and implementation of Apache Spark☆5,347Updated last year
- Interactive and Reactive Data Science using Scala and Spark.☆3,152Updated 2 years ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆27,638Updated last week
- Sparkling Water provides H2O functionality inside Spark cluster☆977Updated 3 weeks ago
- MLeap: Deploy ML Pipelines to Production☆1,529Updated last year
- AI + Data, online. https://vespa.ai☆6,628Updated this week