pixeli99 / ProphetLinks
Official implementation of "Diffusion Language Models Know the Answer Before Decoding"
☆29Updated 2 weeks ago
Alternatives and similar repositories for Prophet
Users that are interested in Prophet are comparing it to the libraries listed below
Sorting:
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Updated 5 months ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆74Updated last week
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆124Updated 2 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆46Updated 2 months ago
- ☆44Updated 3 months ago
- The official code implementation for paper "R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing"☆45Updated 2 weeks ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆92Updated 3 weeks ago
- ☆24Updated 4 months ago
- Implementation and dataset for paper "Can MLLMs Perform Text-to-Image In-Context Learning?"☆41Updated 3 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆35Updated 2 months ago
- VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆52Updated 2 months ago
- ☆55Updated 3 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆39Updated 11 months ago
- [NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models☆96Updated 4 months ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆38Updated 2 months ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆20Updated 6 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆86Updated 11 months ago
- AnchorAttention: Improved attention for LLMs long-context training☆212Updated 8 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆26Updated last month
- The official implementation for Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free☆54Updated 4 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 2 months ago
- ☆17Updated 8 months ago
- ☆100Updated this week
- Code for Heima☆53Updated 5 months ago
- Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasonin…☆39Updated 2 weeks ago
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆48Updated 4 months ago
- Extending context length of visual language models☆12Updated 9 months ago
- ☆34Updated 4 months ago
- A Collection of Papers on Diffusion Language Models☆126Updated last week
- ☆227Updated this week