shkim0116 / KLASSLinks
[NeurIPS 2025 Spotlight] Implementation of "KLASS: KL-Adaptive Stability Sampling for Fast Inference in Masked Diffusion Models"
☆17Updated last month
Alternatives and similar repositories for KLASS
Users that are interested in KLASS are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Official PyTorch implementation of "DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation"☆25Updated 5 months ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆42Updated 3 months ago
- [CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…☆29Updated 2 weeks ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆46Updated last year
- Github repository for "Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging" (ICML 2025)☆86Updated 2 months ago
- Official PyTorch implementation of DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs (ICML 2025 Oral)☆51Updated 5 months ago
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks…☆41Updated last year
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆23Updated 9 months ago
- [AAAI 2025] Official Implementation of I-HallA v1.0☆13Updated 10 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆72Updated 9 months ago
- Model Stock: All we need is just a few fine-tuned models☆127Updated 4 months ago
- CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)☆53Updated 10 months ago
- Code release for "Generative Modeling of Weights: Generalization or Memorization?"☆17Updated 6 months ago
- ☆13Updated 8 months ago
- Official Implementation of "The Role of Masking for Efficient Supervised Knowledge Distillation of Vision Transformers (ECCV 2024)”☆25Updated 10 months ago
- Reproduction of DeepSeek-R1☆244Updated 7 months ago
- DoctorAgent-RL: A Multi-Agent Collaborative Reinforcement Learning System for Multi-Turn Clinical Dialogue☆45Updated last month
- ☆108Updated 8 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆81Updated last month
- MedSafetyBench: Evaluating and Improving the Medical Safety of LLMs, NeurIPS 2024☆36Updated last week
- Sparse autoencoders for vision☆52Updated this week
- dParallel: Learnable Parallel Decoding for dLLMs☆44Updated last month
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆85Updated 2 weeks ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆187Updated last year
- ☆185Updated 6 months ago
- ☆76Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆107Updated 2 years ago
- Preference Learning for LLaVA☆57Updated last year
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆48Updated 2 weeks ago
- KAIST medical VL research group☆19Updated 11 months ago