Kinetics: Rethinking Test-Time Scaling Laws
β87Jul 11, 2025Updated 11 months ago
Alternatives and similar repositories for Kinetics
Users that are interested in Kinetics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NAACL'25 π SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expertβ¦β16Feb 4, 2025Updated last year
- Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"β15Mar 6, 2025Updated last year
- β36Mar 12, 2025Updated last year
- [ICMLβ25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training anβ¦β13Apr 17, 2025Updated last year
- Vortex: Programmable Sparse Attention for Agents as Algorithm Designersβ62Jun 8, 2026Updated 3 weeks ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code for "Reasoning to Learn from Latent Thoughts"β131Mar 28, 2025Updated last year
- β34Oct 13, 2025Updated 8 months ago
- [ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparselyβ24Jun 26, 2024Updated 2 years ago
- [ACL 2025 main] FR-Spec: Frequency-Ranked Speculative Samplingβ55Jul 15, 2025Updated 11 months ago
- β65Jun 12, 2025Updated last year
- [ICMLβ2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chenβ17Sep 7, 2024Updated last year
- [ECCV 2022 Oral] AutoMix: Unveiling the Power of Mixup for Stronger Classifiersβ18Apr 25, 2023Updated 3 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encodersβ19May 23, 2025Updated last year
- Official Implementation of APB (ACL 2025 main Oral) and Spava (ACL 2026 main).β37Apr 6, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement β¦β44Aug 6, 2025Updated 10 months ago
- LLM Inference with Microscaling Formatβ34Nov 12, 2024Updated last year
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining itsβ¦β21Sep 10, 2024Updated last year
- β101May 29, 2026Updated 3 weeks ago
- [ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generationβ254Dec 16, 2024Updated last year
- β123May 19, 2025Updated last year
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUsβ67Mar 25, 2025Updated last year
- [ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inferenceβ397Jul 10, 2025Updated 11 months ago
- Cascade Speculative Draftingβ33Apr 2, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoringβ280Jul 6, 2025Updated 11 months ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversityβ22Aug 28, 2025Updated 10 months ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.β21Apr 3, 2025Updated last year
- [ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inferenceβ308May 1, 2025Updated last year
- Measuring Thinking Efficiency in Reasoning Models - Research Repositoryβ39Dec 2, 2025Updated 6 months ago
- β81Jun 8, 2026Updated 3 weeks ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yongaβ¦β16Jan 3, 2022Updated 4 years ago
- [COLM 2025] Official PyTorch implementation of "Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models"β76Jul 8, 2025Updated 11 months ago
- β25Apr 10, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- β39Feb 26, 2024Updated 2 years ago
- β62May 19, 2025Updated last year
- This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMsβ43Aug 14, 2024Updated last year
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"β15Jul 24, 2024Updated last year
- β14Oct 3, 2024Updated last year
- Continuous Pipelined Speculative Decodingβ20May 25, 2026Updated last month
- β114Aug 26, 2024Updated last year