intel / ipex-llmLinks
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
☆8,130Updated this week
Alternatives and similar repositories for ipex-llm
Users that are interested in ipex-llm are comparing it to the libraries listed below
Sorting:
- Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray☆24Updated 5 years ago
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,678Updated last month
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,870Updated 2 years ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,910Updated last week
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,264Updated last year
- Machine Learning Toolkit for Kubernetes☆15,084Updated last month
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,655Updated 2 weeks ago
- Visualizations for machine learning datasets☆7,375Updated 2 years ago
- Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm☆165Updated 2 months ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,841Updated this week
- Distributed Deep learning with Keras & Spark☆1,571Updated 2 years ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,550Updated 3 weeks ago
- Distributed deep learning on Hadoop and Spark clusters.☆1,259Updated 5 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆20,806Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆12,454Updated this week
- Integration of TensorFlow with other open-source frameworks☆1,371Updated 9 months ago
- PipelineAI☆4,171Updated last year
- A flexible, high-performance serving system for machine learning models☆6,305Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆15,709Updated this week
- A distributed approximate nearest neighborhood search (ANN) library which provides a high quality vector index build, search and distribu…☆4,926Updated this week
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆7,228Updated last week
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆27,124Updated this week
- Retrieval and Retrieval-augmented LLMs☆10,191Updated last week
- A library for efficient similarity search and clustering of dense vectors.☆36,192Updated this week
- A low-latency prediction-serving system☆1,416Updated 4 years ago
- ☆1,658Updated 6 years ago
- Large Language Model Text Generation Inference☆10,334Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆52,682Updated this week
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,582Updated this week
- cuDF - GPU DataFrame Library☆9,055Updated this week