Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm
☆171May 4, 2026Updated last month
Alternatives and similar repositories for ipex-llm-tutorial
Users that are interested in ipex-llm-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…☆8,851Jan 28, 2026Updated 5 months ago
- Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL☆210Jan 3, 2023Updated 3 years ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 3 years ago
- VascuSynth: Vascular Tree Synthesis Software☆11Nov 23, 2022Updated 3 years ago
- A Gradio Web UI for running local LLM on Intel GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) using IPEX-LLM.☆17Jun 21, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An abstraction library for building domain-specific intelligent agents based on Large Language Models (LLMs). LLMAgent provides a core ar…☆27Feb 5, 2026Updated 4 months ago
- 小红书多账号管理☆13Jul 24, 2025Updated 11 months ago
- [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆29Jan 27, 2026Updated 5 months ago
- Multi-Agent LLM System for Digital Scam Protection☆15Dec 19, 2024Updated last year
- Another reverse proxy that provides authentication with OpenID Connect☆10Jul 10, 2023Updated 2 years ago
- PyTorch code for our paper "Progressive Binarization with Semi-Structured Pruning for LLMs"☆13Mar 11, 2026Updated 3 months ago
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 3 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound …☆33Mar 30, 2026Updated 3 months ago
- xeCJK使用范例说明解析☆14Feb 27, 2020Updated 6 years ago
- Playing with io_uring in Zig☆17May 14, 2026Updated last month
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆15Feb 13, 2024Updated 2 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- ☆14Apr 22, 2024Updated 2 years ago
- official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".☆68Dec 25, 2025Updated 6 months ago
- Turn PostgreSQL into your search engine in a Pythonic way.☆52Aug 29, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆2,015Mar 30, 2026Updated 3 months ago
- GCN implementation on top of Apache Spark☆16Oct 30, 2022Updated 3 years ago
- Firmware - CAN enabled small input / output expansion board. Switch panels, steering wheel buttons/knobs, etc.☆15May 13, 2026Updated last month
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆47Dec 12, 2024Updated last year
- A serverless server with wasmer and WebAssembly☆13Sep 23, 2020Updated 5 years ago
- ☆17Jan 30, 2024Updated 2 years ago
- ☆10May 17, 2019Updated 7 years ago
- [ICCV 2025] QuantCache:Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation☆18Sep 26, 2025Updated 9 months ago
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.☆12Aug 15, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An open-source tool created by OctoML that converts TVM-optimized models to code runnable in ONNX Runtime.☆17Mar 30, 2023Updated 3 years ago
- Synthetic data for fine tuning LLM☆27Dec 26, 2024Updated last year
- CVPR 2024 Research Paper with Code☆47Jun 28, 2024Updated 2 years ago
- Extension based on VSCode editor☆15Apr 7, 2023Updated 3 years ago
- Limit Orderbook Replay/Analysis Library☆10Nov 19, 2018Updated 7 years ago
- [MIA'22] Automatic Grading Assessments for Knee MRI Cartilage Defects via Self-ensembling Semi-supervised Learning with Dual-Consistency☆23Jul 27, 2022Updated 3 years ago
- An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.☆14Feb 3, 2025Updated last year