Unparalleled-Calvin / Fudan-course-searchLinks
☆10Updated 4 years ago
Alternatives and similar repositories for Fudan-course-search
Users that are interested in Fudan-course-search are comparing it to the libraries listed below
Sorting:
- ICS_2020_PJ☆10Updated 4 years ago
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆43Updated 4 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆114Updated 5 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆174Updated 2 months ago
- 字节跳动瓜最终真实情况,用事实说话,正义会迟到但不会缺席!☆24Updated last year
- Course Website for ICS Spring 2020 at Fudan University https://sunfloweraries.github.io/ICS-Spring20-Fudan/☆12Updated 5 years ago
- A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention☆212Updated 2 months ago
- Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆74Updated 3 months ago
- ⚡ Bring some magic to i.sjtu.edu.cn☆22Updated 5 years ago
- ☆34Updated 5 months ago
- [TMLR 2025] Efficient Diffusion Models: A Survey☆124Updated 4 months ago
- Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference☆185Updated last month
- ☆78Updated last year
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆32Updated 11 months ago
- USTC-Learning☆17Updated 6 years ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆116Updated last year
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring☆245Updated 4 months ago
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆49Updated this week
- [CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆70Updated last year
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆61Updated 4 months ago
- Openreviewers: Multi Agent Academic Review Simulation System☆22Updated last year
- A collection of papers on discrete diffusion models☆166Updated 4 months ago
- Curated list of methods that focuses on improving the efficiency of diffusion models☆44Updated last year
- toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts☆24Updated last year
- Course notes for Cyber Security (THUCST 2023 Spring)☆29Updated 2 years ago
- (ToCa-v2) A New version of ToCa,with faster speed and better acceleration!☆38Updated 7 months ago
- Official implementation of "DPad: Efficient Diffusion Language Models with Suffix Dropout"☆52Updated 2 months ago
- Paper survey of efficient computation for large scale models.☆34Updated 11 months ago
- A sparse attention kernel supporting mix sparse patterns☆355Updated 8 months ago
- [NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Tok…☆54Updated 2 weeks ago