Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations using ipex-llm
☆171Apr 29, 2025Updated last year
Alternatives and similar repositories for ipex-llm-tutorial
Users that are interested in ipex-llm-tutorial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…☆8,794Jan 28, 2026Updated 3 months ago
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆27Mar 25, 2025Updated last year
- Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL☆210Jan 3, 2023Updated 3 years ago
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆576Apr 24, 2026Updated last week
- ☆13Oct 28, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 📚 Jupyter notebook tutorials for OpenVINO™☆3,119Updated this week
- This is Microsoft-Phi-3-NvidiaNIMWorkshop☆22Aug 16, 2024Updated last year
- A Gradio Web UI for running local LLM on Intel GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) using IPEX-LLM.☆17Apr 26, 2026Updated last week
- ☆15May 17, 2024Updated last year
- [ICML2025] KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference☆28Jan 27, 2026Updated 3 months ago
- [NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning☆19May 31, 2025Updated 11 months ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- ☆17Dec 16, 2024Updated last year
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Botticelli is an open-source .NET Core framework for building universal chatbots. It enables seamless integration with databases, queue b…☆15Mar 14, 2026Updated last month
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- xeCJK使用范例说明解析☆14Feb 27, 2020Updated 6 years ago
- This repository contains resources, documentation and artifacts describing LLM agents☆15Jan 22, 2025Updated last year
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago
- ☆14Apr 22, 2024Updated 2 years ago
- python quant☆50Aug 24, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 微博、B站、百度、百度贴吧热搜Python爬虫获取☆12Mar 1, 2026Updated 2 months ago
- Kexplain is an interactive kubectl explain☆12Oct 23, 2023Updated 2 years ago
- 基于 ZeroMQ 封装的进程间通信库,支持按 Topic 过滤的发布订阅模式和 RPC 模式通信☆15Feb 2, 2023Updated 3 years ago
- A modern, single-page web chat interface for local LLMs (Large Language Models), inspired by the visual style and UX of Anthropic's Claud…☆32May 11, 2025Updated 11 months ago
- With OpenVINO Test Drive, users can run large language models (LLMs) and models trained by Intel Geti on their devices, including AI PCs …☆37Mar 12, 2026Updated last month
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- Tools for easier OpenVINO development/debugging☆10Jul 16, 2025Updated 9 months ago
- An open-source tool created by OctoML that converts TVM-optimized models to code runnable in ONNX Runtime.☆17Mar 30, 2023Updated 3 years ago
- Synthetic data for fine tuning LLM☆27Dec 26, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Building Llama 3 from scratch using PyTorch☆13Sep 1, 2024Updated last year
- Baidu Tieba scraper/crawler | 贴吧爬虫:提取全吧数据 | Part of Internet Archiving Initiative "Project Ex Nihilo"☆19Feb 12, 2024Updated 2 years ago
- Building reliable Retrieval Augmented Generation(RAG) AI Architecture☆13Jul 30, 2024Updated last year
- Limit Orderbook Replay/Analysis Library☆10Nov 19, 2018Updated 7 years ago
- Microservice SIG is committed to providing a standardized service governance solution for distributed and microservice architecture☆35Feb 4, 2023Updated 3 years ago
- ProxQuant: Quantized Neural Networks via Proximal Operators☆30Feb 19, 2019Updated 7 years ago
- Graphical user interface for tensor networks☆13Jul 27, 2020Updated 5 years ago