william-sto / JusticeNeverTooLateLinks
字节跳动瓜最终真实情况,用事实说话,正义会迟到但不会缺席!
☆24Updated last year
Alternatives and similar repositories for JusticeNeverTooLate
Users that are interested in JusticeNeverTooLate are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆43Updated 4 months ago
- A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention☆212Updated 2 months ago
- Openreviewers: Multi Agent Academic Review Simulation System☆22Updated last year
- [NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification☆29Updated 7 months ago
- ☆35Updated 7 months ago
- Using message app/bot to notify you when doing time-consuming tasks. Bake your experiments!☆80Updated 2 weeks ago
- ☆28Updated 3 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆114Updated 5 months ago
- ☆27Updated last month
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆174Updated 2 months ago
- Open-Pandora: On-the-fly Control Video Generation☆35Updated 11 months ago
- A sparse attention kernel supporting mix sparse patterns☆355Updated 8 months ago
- 清华大学飞跃数据库☆31Updated last week
- This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel s…☆50Updated this week
- A happy way for research!☆23Updated 2 years ago
- [ICLR 2025] Mixture Compressor for Mixture-of-Experts LLMs Gains More☆61Updated 8 months ago
- Course notes for Cyber Security (THUCST 2023 Spring)☆29Updated 2 years ago
- Code for Draft Attention☆92Updated 5 months ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆54Updated last week
- SuperDebug,debug如此简单!☆17Updated 3 years ago
- [NeurIPS'25] The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Tok…☆54Updated 2 weeks ago
- Official implementation of "DPad: Efficient Diffusion Language Models with Suffix Dropout"☆52Updated 2 months ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆53Updated 7 months ago
- Official PyTorch implementation of the paper "Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Princ…☆34Updated 3 months ago
- [arxiv 2025] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning☆33Updated last week
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆61Updated 4 months ago
- >>> 异常中断 + 虚存页表 + 分支预测 + TLB + Cache + Flash + VGA + uCore☆18Updated last year
- Collected the world's best computer vision labs and lecture materials.☆14Updated 8 months ago
- Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆74Updated 3 months ago
- The blog, read report and code example for AGI/LLM related knowledge.☆48Updated 9 months ago