intel-analytics / ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc
☆6,927Updated this week
Alternatives and similar repositories for ipex-llm:
Users that are interested in ipex-llm are comparing it to the libraries listed below
- Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray☆23Updated 4 years ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,872Updated last year
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,671Updated 2 months ago
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,337Updated last month
- Unsupervised text tokenizer for Neural Network-based text generation.☆10,479Updated last month
- oneAPI Deep Neural Network Library (oneDNN)☆3,677Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆11,930Updated last month
- cuDF - GPU DataFrame Library☆8,597Updated this week
- Distributed Deep learning with Keras & Spark☆1,572Updated last year
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆6,991Updated this week
- Open source platform for the machine learning lifecycle☆19,253Updated this week
- An open source ML system for the end-to-end data science lifecycle☆1,039Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆14,846Updated this week
- Integration of TensorFlow with other open-source frameworks☆1,373Updated 3 months ago
- An open-source NLP research library, built on PyTorch.☆11,782Updated 2 years ago
- PipelineAI☆4,171Updated 9 months ago
- TensorFlow's Visualization Toolkit☆6,774Updated 3 weeks ago
- Library for fast text representation and classification.☆26,017Updated 9 months ago
- A flexible, high-performance serving system for machine learning models☆6,212Updated 2 weeks ago
- Parallel computing with task scheduling☆12,851Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆7,353Updated this week
- DyNet: The Dynamic Neural Network Toolkit☆3,426Updated last year
- A natural language modeling framework based on PyTorch☆6,334Updated 2 years ago
- Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm☆152Updated 5 months ago
- TensorFlow code and pre-trained models for BERT☆38,519Updated 5 months ago
- Alluxio, data orchestration for analytics and machine learning in the cloud☆6,902Updated last month
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆9,806Updated 5 months ago