Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm
☆171Apr 29, 2025Updated 11 months ago
Alternatives and similar repositories for ipex-llm-tutorial
Users that are interested in ipex-llm-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆27Mar 25, 2025Updated last year
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆561Apr 2, 2026Updated last week
- ☆13Oct 28, 2020Updated 5 years ago
- 📚 Jupyter notebook tutorials for OpenVINO™☆3,096Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Another ChatGLM2 implementation for GPTQ quantization☆55Oct 15, 2023Updated 2 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- OpenVINO LLM Benchmark☆11Dec 7, 2023Updated 2 years ago
- Demo on iGPU for FFmpeg decode and scale, OpenVINO inference. this is zero-copy solution, which means No frame data copy from CPU to iGPU…☆17Jan 25, 2023Updated 3 years ago
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- pytorch code examples for measuring the performance of collective communication calls in AI workloads☆19Sep 18, 2025Updated 6 months ago
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".☆55Dec 25, 2025Updated 3 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Turn PostgreSQL into your search engine in a Pythonic way.☆52Aug 29, 2025Updated 7 months ago
- Intel® Tensor Processing Primitives extension for Pytorch*☆18Updated this week
- Mirror of the now discontinued ORCA RISC-V processor from VectorBlox.☆10Feb 11, 2020Updated 6 years ago
- 北邮统一登录网关 Session。用于需要登录的网络请求。☆14Sep 17, 2022Updated 3 years ago
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆48Dec 12, 2024Updated last year
- [ICCV 2025] QuantCache:Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation☆16Sep 26, 2025Updated 6 months ago
- Xcbwin - a simple C++ class for graphical outputs using XCB☆12May 12, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- With OpenVINO Test Drive, users can run large language models (LLMs) and models trained by Intel Geti on their devices, including AI PCs …☆37Mar 12, 2026Updated last month
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- Tools for easier OpenVINO development/debugging☆10Jul 16, 2025Updated 8 months ago
- Building Llama 3 from scratch using PyTorch☆13Sep 1, 2024Updated last year
- My Interview recording repo.☆11Mar 22, 2023Updated 3 years ago
- CVPR 2024 Research Paper with Code☆48Jun 28, 2024Updated last year
- Limit Orderbook Replay/Analysis Library☆10Nov 19, 2018Updated 7 years ago
- A tool to help you to copy an AMI from your Worldwide AWS account to China account.☆11Sep 16, 2023Updated 2 years ago
- Workshop for Model Context Protocol☆17Mar 27, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Cyclone Jet Rocket is a DDoS tool for System Security Technology course☆11Jun 5, 2017Updated 8 years ago
- ☆12Mar 1, 2024Updated 2 years ago
- Improving langchain knowledge graphs using baml☆43Aug 3, 2025Updated 8 months ago
- A scalable inference server for models optimized with OpenVINO™☆855Updated this week
- Simplifying RAG with PostgreSQL and PGVector☆16Jul 31, 2024Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- An LLM extension for PowerToys Command Palette☆61Jun 21, 2025Updated 9 months ago