william-sto / JusticeNeverTooLateLinks
字节跳动瓜最终真实情况,用事实说话,正义会迟到但不会缺席!
☆24Updated 9 months ago
Alternatives and similar repositories for JusticeNeverTooLate
Users that are interested in JusticeNeverTooLate are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆29Updated last month
- ☆113Updated 2 months ago
- EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE☆11Updated last year
- Open-Pandora: On-the-fly Control Video Generation☆34Updated 7 months ago
- My Curriculum Vitae☆62Updated 3 years ago
- Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)☆17Updated last year
- Using message app/bot to notify you when doing time-consuming tasks. Bake your experiments!☆74Updated last week
- Accelerate LLM preference tuning via prefix sharing with a single line of code☆42Updated 2 weeks ago
- Official code for ICLR 2024 paper "Do Generated Data Always Help Contrastive Learning?"☆31Updated last year
- Examples and instructions about use LLMs (especially ChatGPT) for PhD☆108Updated 2 years ago
- ☆77Updated 4 months ago
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆72Updated 2 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆128Updated this week
- Guide for surviving at UIUC (under development)☆69Updated last month
- [NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification☆23Updated 3 months ago
- ☆31Updated 3 months ago
- A collection of papers on discrete diffusion models☆152Updated 2 weeks ago
- A happy way for research!☆23Updated 2 years ago
- ☆88Updated last month
- ☆37Updated 2 months ago
- A Telegram bot to recommend arXiv papers☆276Updated 3 months ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆27Updated last month
- [ICLR 2025] Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models☆118Updated last week
- MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images☆39Updated last month
- ☆75Updated last week
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆77Updated 5 months ago
- The blog, read report and code example for AGI/LLM related knowledge.☆40Updated 5 months ago
- ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆46Updated last month
- Compiler-R1: Towards Agentic Compiler Auto-tuning with Reinforcement Learning☆12Updated last week
- Collected the world's best computer vision labs and lecture materials.☆14Updated 4 months ago