intel / ipex-llmLinks
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
☆8,096Updated last week
Alternatives and similar repositories for ipex-llm
Users that are interested in ipex-llm are comparing it to the libraries listed below
Sorting:
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,870Updated 2 years ago
- Simple and Distributed Machine Learning☆5,147Updated this week
- PipelineAI☆4,171Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,435Updated this week
- PredictionIO, a machine learning server for developers and ML engineers.☆12,528Updated 4 years ago
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,222Updated this week
- Mirror of Apache Mahout☆2,171Updated last week
- Open source platform for the machine learning lifecycle☆21,203Updated this week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,651Updated last week
- Distributed Deep learning with Keras & Spark☆1,572Updated 2 years ago
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,264Updated last year
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆19,392Updated last month
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆27,082Updated last week
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,167Updated 9 months ago
- Integration of TensorFlow with other open-source frameworks☆1,371Updated 9 months ago
- Tensor library for machine learning☆12,808Updated this week
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆6,973Updated 5 months ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,577Updated this week
- Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.☆6,514Updated last week
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆17,392Updated this week
- cuDF - GPU DataFrame Library☆9,033Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆15,932Updated this week
- PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)☆23,015Updated this week
- Automated Machine Learning with scikit-learn☆7,881Updated 2 weeks ago
- A flexible, high-performance serving system for machine learning models☆6,297Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆51,794Updated this week
- A low-latency prediction-serving system☆1,416Updated 4 years ago
- Fast and Accurate ML in 3 Lines of Code☆9,126Updated this week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,891Updated 3 months ago
- Development repository for the Triton language and compiler☆16,114Updated this week