intel / ipex-llmLinks
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
☆8,536Updated 2 months ago
Alternatives and similar repositories for ipex-llm
Users that are interested in ipex-llm are comparing it to the libraries listed below
Sorting:
- Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray☆24Updated 5 years ago
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,690Updated last month
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,864Updated 2 years ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,997Updated last week
- Tensor library for machine learning☆13,714Updated this week
- Distributed deep learning on Hadoop and Spark clusters.☆1,261Updated 6 years ago
- Open Machine Learning Compiler Framework☆12,908Updated last week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,687Updated last week
- PredictionIO, a machine learning server for developers and ML engineers.☆12,534Updated 4 years ago
- Distribute and run LLMs with a single file.☆23,525Updated this week
- Alluxio, data orchestration for analytics and machine learning in the cloud☆7,128Updated 7 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,377Updated last week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,641Updated 2 weeks ago
- Python bindings for llama.cpp☆9,821Updated 4 months ago
- Distributed Deep learning with Keras & Spark☆1,577Updated 2 years ago
- ☆1,655Updated 7 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,935Updated last week
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,443Updated this week
- MLeap: Deploy ML Pipelines to Production☆1,527Updated last year
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,220Updated last week
- Simple and Distributed Machine Learning☆5,191Updated this week
- A flexible, high-performance serving system for machine learning models☆6,336Updated last month
- cuDF - GPU DataFrame Library☆9,375Updated last week
- An open source ML system for the end-to-end data science lifecycle☆1,075Updated last week
- Resource scheduling and cluster management for AI☆2,684Updated last year
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,131Updated this week
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆10,031Updated 3 months ago
- High-speed Large Language Model Serving for Local Deployment☆8,460Updated 4 months ago
- Sparkling Water provides H2O functionality inside Spark cluster☆977Updated last month
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,784Updated 4 years ago