Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm
☆169Apr 29, 2025Updated 10 months ago
Alternatives and similar repositories for ipex-llm-tutorial
Users that are interested in ipex-llm-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆27Mar 25, 2025Updated 11 months ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆549Mar 16, 2026Updated last week
- 📚 Jupyter notebook tutorials for OpenVINO™☆3,066Updated this week
- This is Microsoft-Phi-3-NvidiaNIMWorkshop☆22Aug 16, 2024Updated last year
- A Gradio Web UI for running local LLM on Intel GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) using IPEX-LLM.☆18Mar 15, 2026Updated last week
- An abstraction library for building domain-specific intelligent agents based on Large Language Models (LLMs). LLMAgent provides a core ar…☆27Feb 5, 2026Updated last month
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- An OpenAI API compatible images server to generate or manipulate images.☆17Feb 2, 2025Updated last year
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- ☆17Dec 16, 2024Updated last year
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated last year
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆28Nov 1, 2025Updated 4 months ago
- xeCJK使用范例说明解析☆14Feb 27, 2020Updated 6 years ago
- Playing with io_uring in Zig☆17May 24, 2020Updated 5 years ago
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- Demo on iGPU for FFmpeg decode and scale, OpenVINO inference. this is zero-copy solution, which means No frame data copy from CPU to iGPU…☆17Jan 25, 2023Updated 3 years ago
- This repository contains resources, documentation and artifacts describing LLM agents☆15Jan 22, 2025Updated last year
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago
- My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022☆45Mar 22, 2024Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Turn PostgreSQL into your search engine in a Pythonic way.☆51Aug 29, 2025Updated 6 months ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,014Mar 13, 2026Updated last week
- ☆20Feb 18, 2025Updated last year
- Mirror of the now discontinued ORCA RISC-V processor from VectorBlox.☆10Feb 11, 2020Updated 6 years ago
- ☆19Feb 11, 2026Updated last month
- Kexplain is an interactive kubectl explain☆12Oct 23, 2023Updated 2 years ago
- ☆17Jan 30, 2024Updated 2 years ago
- Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight☆37May 23, 2023Updated 2 years ago
- Xcbwin - a simple C++ class for graphical outputs using XCB☆12May 12, 2015Updated 10 years ago
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Aug 15, 2020Updated 5 years ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- With OpenVINO Test Drive, users can run large language models (LLMs) and models trained by Intel Geti on their devices, including AI PCs …☆37Mar 12, 2026Updated last week
- Official implementation "ChipNet: Budget-Aware Pruning with Heaviside Continuous Approximations"☆21Oct 29, 2022Updated 3 years ago
- An open-source tool created by OctoML that converts TVM-optimized models to code runnable in ONNX Runtime.☆17Mar 30, 2023Updated 2 years ago
- Building reliable Retrieval Augmented Generation(RAG) AI Architecture☆13Jul 30, 2024Updated last year
- A simple Diagnostics over IP (DoIP) parser written in C.☆13Aug 16, 2016Updated 9 years ago
- Open-source observability for your LLM application.☆54Jan 2, 2025Updated last year