intel-analytics / ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc
☆6,683Updated this week
Related projects ⓘ
Alternatives and complementary repositories for ipex-llm
- BigDL: Distributed TensorFlow, Keras and PyTorch on Apache Spark/Flink & Ray☆2,665Updated last week
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,876Updated last year
- PipelineAI☆4,168Updated 6 months ago
- Unsupervised text tokenizer for Neural Network-based text generation.☆10,252Updated last week
- Sparkling Water provides H2O functionality inside Spark cluster☆967Updated this week
- An open source ML system for the end-to-end data science lifecycle☆1,035Updated last week
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆6,915Updated this week
- A system for quickly generating training data with weak supervision☆5,807Updated 6 months ago
- A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used …☆16,671Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆8,296Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆33,803Updated this week
- Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.☆14,247Updated 2 months ago
- Distributed Deep learning with Keras & Spark☆1,574Updated last year
- Integration of TensorFlow with other open-source frameworks☆1,374Updated last month
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆13,233Updated 3 months ago
- Microsoft Distributed Machine Learning Toolkit☆2,745Updated 6 years ago
- Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.☆15,514Updated last year
- MLeap: Deploy ML Pipelines to Production☆1,503Updated 4 months ago
- An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models☆4,377Updated this week
- A library for efficient similarity search and clustering of dense vectors.☆31,320Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆11,761Updated this week
- The Open Source Feature Store for Machine Learning☆5,592Updated this week
- Distributed deep learning on Hadoop and Spark clusters.☆1,266Updated 4 years ago
- Development repository for the Triton language and compiler☆13,311Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆29,785Updated this week
- Visualizations for machine learning datasets☆7,355Updated last year
- Open source platform for the machine learning lifecycle☆18,704Updated this week