neilwen987 / CSR_Adaptive_RepLinks
Official Code for Paper: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation
☆133Updated last month
Alternatives and similar repositories for CSR_Adaptive_Rep
Users that are interested in CSR_Adaptive_Rep are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆175Updated 4 months ago
- ☆236Updated last week
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆99Updated last week
- LLaDA2.0 is the diffusion language model series developed by InclusionAI team, Ant Group.☆240Updated last month
- [ICLR 2026] Geometric-Mean Policy Optimization☆99Updated 2 weeks ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last week
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆50Updated this week
- Easy and Efficient dLLM Fine-Tuning☆209Updated 2 weeks ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆120Updated last month
- Reinforcement Learning via Self-Distillation (SDPO)☆285Updated this week
- AnchorAttention: Improved attention for LLMs long-context training☆213Updated last year
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆32Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆52Updated last year
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆75Updated 7 months ago
- ☆91Updated last year
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆57Updated 8 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆154Updated 7 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction