william-sto / JusticeNeverTooLateLinks
字节跳动瓜最终真实情况,用事实说话,正义会迟到但不会缺席!
☆24Updated 9 months ago
Alternatives and similar repositories for JusticeNeverTooLate
Users that are interested in JusticeNeverTooLate are comparing it to the libraries listed below
Sorting:
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆17Updated last year
- EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE☆11Updated last year
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆38Updated 6 months ago
- Exploring Diffusion Transformer Designs via Grafting☆48Updated last month
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆134Updated this week
- ☆89Updated 2 months ago
- [NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification☆23Updated 4 months ago
- This repository provides a comprehensive library for parallel training and LoRA algorithm implementations, supporting multiple parallel s…☆48Updated this week
- Openreviewers: Multi Agent Academic Review Simulation System☆20Updated last year
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆31Updated last month
- ☆194Updated last week
- siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems☆160Updated this week
- Using message app/bot to notify you when doing time-consuming tasks. Bake your experiments!☆74Updated last month
- ☆32Updated 4 months ago
- ☆79Updated 5 months ago
- Course notes for Cyber Security (THUCST 2023 Spring)☆30Updated 2 years ago
- Open-Pandora: On-the-fly Control Video Generation☆34Updated 8 months ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆51Updated 4 months ago
- ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆46Updated 2 months ago
- ☆37Updated 2 months ago
- A happy way for research!☆23Updated 2 years ago
- What are learned in tiktoken?☆69Updated last year
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆110Updated last month
- A Telegram bot to recommend arXiv papers☆281Updated 4 months ago
- A collection of papers on discrete diffusion models☆156Updated last month
- ✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆72Updated last month
- ☆120Updated 2 months ago
- VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆49Updated last month
- Code release for paper "Test-Time Training Done Right"☆249Updated 3 weeks ago
- [ICLR 2025] Mixture Compressor for Mixture-of-Experts LLMs Gains More☆48Updated 6 months ago