chenxingqiang / YiRageLinks
YiRage (Yield Revolutionary AGile Engine) - Multi-Backend LLM Inference Optimization. Extends Mirage with comprehensive support for CUDA, MPS, CPU, Triton, NKI, cuDNN, and MKL backends.
☆37Updated this week
Alternatives and similar repositories for YiRage
Users that are interested in YiRage are comparing it to the libraries listed below
Sorting:
- Official implementation of "REASONING COMPILER: LLM-Guided Optimizations for Efficient Model Serving" (NeurIPS 2025)☆97Updated 3 weeks ago
- A Tiny structure of pytorch for learning;☆60Updated last year
- ☆123Updated last week
- Butter is a novel 2D object detection framework designed to enhance hierarchical feature representations for improved detection robustnes…☆85Updated 4 months ago
- A toolkit enhances PyTorch with specialized functions for low-bit quantized neural networks.☆196Updated last year
- 4th Place Solution for the Kaggle Competition: LMSYS - Chatbot Arena Human Preference Predictions☆171Updated last year
- Step-by-step optimization of TPU MatMul Kernels☆85Updated 5 months ago
- This is the code for Visual Reasoning Sequential Attack, which is a method to jailbreak Multimodal Large Language Models Based on their v…☆64Updated 3 weeks ago
- ☆135Updated last year
- Code for "FaithLens: Detecting and Explaining Faithfulness Hallucination"☆54Updated this week
- ☆143Updated last year
- ☆24Updated last year
- Hands-on construction of a complete neural network☆14Updated 2 years ago
- ☆279Updated 7 months ago
- Desktop Tiny Agent is a lightweight, modular desktop intelligent agent framework. It offers plugin extensibility, task scheduling (sync/a…☆80Updated 4 months ago
- ☆137Updated last year
- A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…☆252Updated last year
- slark is a cross platform player that supports iOS and Android☆98Updated this week
- mini-webui delivers a streamlined AI chat console for teams that need rapid iteration, reliable integrations, and production-ready guardr…☆44Updated last month
- Example project using universal links as deeplinks to switch iOS apps.☆13Updated last year
- Code and dataset of ARMOUR: zero-permission sensor usage (ACM WiSec 2025)☆38Updated 6 months ago
- The code for TPAMI paper "Text-Guided Human Image Manipulation via Image-Text Shared Space"☆86Updated 3 years ago
- A pytorch implementation of the paper "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Simi…☆343Updated 2 weeks ago
- [CVPR 2025] MDP: Multidimensional Vision Model Pruning with Latency Constraint☆169Updated 3 months ago
- ☆30Updated 10 months ago
- (TIP 2022) Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction☆109Updated 9 months ago
- Adapt Diffusion Models to Multi-frame interpolation☆31Updated last week
- 采集管家☆313Updated 6 months ago
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆236Updated 4 months ago
- ☆38Updated 8 months ago