intel / ipex-llmLinks
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
☆8,377Updated 3 weeks ago
Alternatives and similar repositories for ipex-llm
Users that are interested in ipex-llm are comparing it to the libraries listed below
Sorting:
- Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray☆24Updated 5 years ago
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,687Updated this week
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,869Updated 2 years ago
- Simple and Distributed Machine Learning☆5,170Updated last week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,603Updated 3 weeks ago
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆39,299Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆9,879Updated this week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,973Updated this week
- oneAPI Deep Neural Network Library (oneDNN)☆3,897Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆18,897Updated this week
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,859Updated last month
- Distributed deep learning on Hadoop and Spark clusters.☆1,260Updated 5 years ago
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,166Updated last year
- Resource scheduling and cluster management for AI☆2,676Updated last year
- Large Language Model Text Generation Inference☆10,566Updated 3 weeks ago
- Machine Learning Toolkit for Kubernetes☆15,233Updated 2 months ago
- Integration of TensorFlow with other open-source frameworks☆1,373Updated last year
- An open source ML system for the end-to-end data science lifecycle☆1,063Updated this week
- MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle☆3,682Updated last month
- Ongoing research training transformer models at scale☆13,824Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆18,060Updated this week
- Low-code framework for building custom LLMs, neural networks, and other AI models☆11,597Updated last week
- A flexible, high-performance serving system for machine learning models☆6,326Updated this week
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,269Updated 2 years ago
- Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.☆2,745Updated last year
- An all-in-one Docker image for deep learning. Contains all the popular DL frameworks (TensorFlow, Theano, Torch, Caffe, etc.)☆3,863Updated 6 years ago
- High-speed Large Language Model Serving for Local Deployment☆8,363Updated 2 months ago
- Build and run Docker containers leveraging NVIDIA GPUs☆17,428Updated last year
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,146Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,828Updated this week