Official repository of "Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions", ICLR 2024 Spotlight
☆21Mar 7, 2024Updated last year
Alternatives and similar repositories for Instructive-Decoding
Users that are interested in Instructive-Decoding are comparing it to the libraries listed below
Sorting:
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆65Sep 28, 2024Updated last year
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆19Jan 17, 2026Updated last month
- (CVPR 2024) Official Implementation of "FedSOL: Stabilized Orthogonal Learning with Proximal Restrictions in Federated Learning"☆15Jun 28, 2024Updated last year
- Code for CIKM 2021 best short paper nomination "Modeling Sequences as Distributions with Uncertainty for Sequential Recommendation" https…☆16Jun 11, 2021Updated 4 years ago
- Multi-head Recurrent Layer Attention for Vision Network☆22Mar 2, 2023Updated 2 years ago
- ☆20Dec 24, 2024Updated last year
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- Triton implement of bi-directional (non-causal) linear attention☆68Updated this week
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).☆25Updated this week
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆26Apr 15, 2025Updated 10 months ago
- Addressing the problem of predicting crime occurrence based on historic records☆11Nov 27, 2019Updated 6 years ago
- ☆32Mar 9, 2022Updated 3 years ago
- The repository includes evidence that a published paper in TKDE shares surprisingly high similarity to our paper.☆32Dec 18, 2022Updated 3 years ago
- [ICLR2023] Towards Understanding and Mitigating Dimensional Collapse in Heterogeneous Federated Learning (https://arxiv.org/abs/2210.0022…☆40Jan 30, 2023Updated 3 years ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- ☆13Jun 18, 2025Updated 8 months ago
- ☆11Feb 19, 2022Updated 4 years ago
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆49Jun 17, 2025Updated 8 months ago
- Official code for the paper "Attention as a Hypernetwork"☆48Jun 22, 2024Updated last year
- FocusLLM: Scaling LLM’s Context by Parallel Decoding☆44Dec 8, 2024Updated last year
- Code for AAAI21 paper "Scalable and Explainable 1-Bit Matrix Completion via Graph Signal Learning"☆11Feb 15, 2022Updated 4 years ago
- ☆11Jul 20, 2021Updated 4 years ago
- The official PyTorch implementation of "An Attentional Multi-scale Co-evolving Model for Dynamic Link Prediction" (TheWebConf'23)☆11May 4, 2023Updated 2 years ago
- Repository for the DPP'23 course☆11May 2, 2024Updated last year
- Graphical intuition to MOSFET square-law☆11Jan 5, 2021Updated 5 years ago
- Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"☆12Feb 8, 2023Updated 3 years ago
- "SCONE: A Novel Stochastic Sampling to Generate Contrastive Views and Hard Negative Samples for Recommendation", WSDM 2025☆14Nov 25, 2025Updated 3 months ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 4 months ago
- Generic library for neural collapse and several derivative works on the phenomenon.☆18Apr 14, 2025Updated 10 months ago
- Vision Transformer (ViT) models, with their attention mechanisms, revolutionized computer vision. By merging Class Activation Map (CAM) a…☆13Aug 14, 2023Updated 2 years ago
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago
- Mitigating the Filter Bubble while Maintaining Relevance: Targeted Diversification with VAE-based Recommender Systems☆10Mar 15, 2023Updated 2 years ago
- ☆20Aug 8, 2025Updated 6 months ago
- This is the code of paper: Robust Mid-Pass Filtering Graph Convolutional Networks.(paper accepted by WWW2023)☆13Feb 17, 2023Updated 3 years ago
- ☆11Jan 7, 2025Updated last year
- ☆47Nov 8, 2024Updated last year
- ☆47Mar 14, 2025Updated 11 months ago
- Allen-Cahn Equation☆15Feb 20, 2023Updated 3 years ago
- dMel: Speech Tokenization Made Simple☆16May 13, 2025Updated 9 months ago