czg1225 / VeriThinkerLinks
VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆38Updated last week
Alternatives and similar repositories for VeriThinker
Users that are interested in VeriThinker are comparing it to the libraries listed below
Sorting:
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆60Updated last week
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆40Updated 3 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆31Updated 3 months ago
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"☆47Updated 2 months ago
- Fast-Slow Thinking for Large Vision-Language Model Reasoning☆14Updated last month
- Code for CVPR 2024 Oral "Neural Lineage"☆17Updated 11 months ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆49Updated this week
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Updated 7 months ago
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆53Updated last week
- Adapting LLaMA Decoder to Vision Transformer☆28Updated last year
- ☆111Updated last week
- ☆74Updated 2 weeks ago
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆45Updated 2 months ago
- [CVPR] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization☆24Updated 2 months ago
- ☆81Updated 2 months ago
- ☆84Updated 2 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆44Updated 3 months ago
- Autoregressive Image Generation with Randomized Parallel Decoding☆63Updated 2 months ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆117Updated 2 weeks ago
- ✈️ Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints☆67Updated 2 months ago
- GIFT: Generative Interpretable Fine-Tuning☆20Updated 7 months ago
- Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models☆22Updated this week
- ☆25Updated last month
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆102Updated 2 months ago
- ☆32Updated 3 weeks ago
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 6 months ago
- ☆16Updated 5 months ago
- Data distillation benchmark☆64Updated this week
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆27Updated last month
- [ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆36Updated 3 months ago