Tonanguyxiro / HKUST-GZ_RBM_Research_ProposalLinks
☆26Updated last year
Alternatives and similar repositories for HKUST-GZ_RBM_Research_Proposal
Users that are interested in HKUST-GZ_RBM_Research_Proposal are comparing it to the libraries listed below
Sorting:
- ☆49Updated 11 months ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Updated 2 years ago
- Facilitating selective network routing for Ivanti-connected devices to a school's network, using port forwarding for enhanced access cont…☆13Updated last year
- Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"☆14Updated 3 months ago
- ChatEDA: A Large Language Model Powered Autonomous Agent for EDA☆25Updated 2 weeks ago
- ☆14Updated last year
- ☆45Updated 7 months ago
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning☆93Updated 9 months ago
- Polynormer: Polynomial-Expressive Graph Transformer in Linear Time☆40Updated last year
- Chinese Translation for Philip Guo's The PhD Grind☆80Updated 2 years ago
- The official implementation of WSDM'24 paper <DeSCo: Towards Generalizable and Scalable Deep Subgraph Counting>☆19Updated last year
- analyse problems of AI with Math and Code☆16Updated last week
- ☆49Updated 8 months ago
- Differentiable Combinatorial Scheduling at Scale (ICML'24). Mingju Liu, Yingjie Li, Jiaqi Yin, Zhiru Zhang, Cunxi Yu.☆21Updated 7 months ago
- Codebase for ICML'24 paper: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs☆26Updated 11 months ago
- Code release for AdapMoE accepted by ICCAD 2024☆26Updated last month
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs☆47Updated 2 months ago
- Adaptive Attention Sparsity with Hierarchical Top-p Pruning☆17Updated 3 months ago
- An Optimizing Framework on MLIR for Efficient FPGA-based Accelerator Generation☆45Updated last year
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.☆28Updated 2 years ago
- The official implementation of the DAC 2024 paper GQA-LUT☆18Updated 5 months ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆18Updated 2 years ago
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆61Updated 2 years ago
- [DAC'25] Official implement of "HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference"☆51Updated 2 weeks ago
- ☆43Updated 2 years ago
- FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]☆21Updated last week
- Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits☆26Updated 9 months ago
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆30Updated 6 months ago
- CircuitFusion: Multimodal Circuit Representation Learning for Agile Chip Design (ICLR'25)☆13Updated last month
- Openreviewers: Multi Agent Academic Review Simulation System☆20Updated last year