linkedlist771 / UCAS-MOOC-AutoWatchLinks
☆21Updated last year
Alternatives and similar repositories for UCAS-MOOC-AutoWatch
Users that are interested in UCAS-MOOC-AutoWatch are comparing it to the libraries listed below
Sorting:
- Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing☆55Updated 7 months ago
- 高级计算机体系结构2020,吴俊敏老师,中科大研究生课程☆71Updated last year
- 智能计算系统 AI Computing Systems 陈云霁☆167Updated 2 years ago
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆187Updated last month
- 国科大《智能计算系统》课程实验☆23Updated last year
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆43Updated 8 months ago
- ☆47Updated 10 months ago
- ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression (DAC'25)☆14Updated 2 months ago
- Fairy±i (iFairy): Complex-valued Quantization Framework for Large Language Models☆99Updated last week
- ☆15Updated 5 months ago
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning☆101Updated last year
- A Manual on Surviving in CS of NWPU☆52Updated 2 years ago
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆58Updated 10 months ago
- 21-22春 智能计算系统实验 中国科学院大学☆33Updated 3 years ago
- ☆77Updated 10 months ago
- Code release for AdapMoE accepted by ICCAD 2024☆32Updated 4 months ago
- ☆55Updated last year
- 国科大英语慕课学习辅助工具☆11Updated last year
- Introduction to Computer Systems (II), Spring 2021☆51Updated 4 years ago
- 用于国科大自动评教。☆14Updated last year
- ☆131Updated last month
- A repository sharing the literatures about large language models☆100Updated last month
- Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference☆42Updated last year
- Curated collection of papers in MoE model inference☆250Updated last month
- LLCL-MIPS is a superscalar MIPS processor, which supports MIPS Release 1 instructions and is capable of booting linux kernel. (第五届龙芯杯特等奖作…☆37Updated 3 years ago
- HPC-Lab for High Performance Computing course, 2023 Spring , Tsinghua Universit. 高性能计算导论 @ THU.☆25Updated 2 years ago
- analyse problems of AI with Math and Code☆21Updated last month
- Adaptive Attention Sparsity with Hierarchical Top-p Pruning☆19Updated 6 months ago
- ☆143Updated 2 months ago
- [DAC 2024] EDGE-LLM: Enabling Efficient Large Language Model Adaptation on Edge Devices via Layerwise Unified Compression and Adaptive La…☆65Updated last year