intel / ipex-llmLinks
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
☆8,283Updated 2 weeks ago
Alternatives and similar repositories for ipex-llm
Users that are interested in ipex-llm are comparing it to the libraries listed below
Sorting:
- Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray☆24Updated 5 years ago
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,685Updated 2 weeks ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,868Updated 2 years ago
- Simple and Distributed Machine Learning☆5,162Updated 3 weeks ago
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,276Updated this week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,581Updated last month
- A flexible, high-performance serving system for machine learning models☆6,319Updated this week
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆17,547Updated this week
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,824Updated last year
- A system for quickly generating training data with weak supervision☆5,911Updated last year
- An open source ML system for the end-to-end data science lifecycle☆1,062Updated this week
- Breeze is/was a numerical processing library for Scala.☆3,456Updated last year
- High-speed Large Language Model Serving for Local Deployment☆8,319Updated last month
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,945Updated last week
- MLeap: Deploy ML Pipelines to Production☆1,521Updated 9 months ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,785Updated 4 years ago
- The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end …☆21,919Updated this week
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆27,296Updated this week
- Distributed deep learning on Hadoop and Spark clusters.☆1,259Updated 5 years ago
- OpenVINO™ is an open source toolkit for optimizing and deploying AI inference☆8,786Updated this week
- Open standard for machine learning interoperability☆19,554Updated this week
- Sparkling Water provides H2O functionality inside Spark cluster☆975Updated 2 weeks ago
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,896Updated this week
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆19,582Updated this week
- REST job server for Apache Spark☆2,844Updated last month
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆33,316Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,571Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆38,769Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,099Updated 2 weeks ago
- An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model c…☆14,265Updated last year