MikaStars39 / StableMaskLinks
PyTorch implementation of StableMask (ICML'24)
☆13Updated 11 months ago
Alternatives and similar repositories for StableMask
Users that are interested in StableMask are comparing it to the libraries listed below
Sorting:
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆38Updated 8 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆38Updated 3 weeks ago
- Open-Pandora: On-the-fly Control Video Generation☆34Updated 6 months ago
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆69Updated 2 weeks ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆35Updated last week
- Preference Learning for LLaVA☆46Updated 7 months ago
- ☆85Updated 2 months ago
- [NeurIPS 2023] Make Your Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning☆31Updated 2 years ago
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆24Updated last year
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"☆30Updated last month
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆88Updated last month
- [MM2024, oral] "Self-Supervised Visual Preference Alignment" https://arxiv.org/abs/2404.10501☆55Updated 10 months ago
- ☆51Updated 3 months ago
- Official Repository of LatentSeek☆48Updated 2 weeks ago
- ☆42Updated 7 months ago
- ☆17Updated 5 months ago
- ☆50Updated last year
- (ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.☆34Updated 3 months ago
- ☆29Updated last year
- ☆18Updated last month
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆50Updated 6 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 8 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆44Updated 3 months ago
- The official implementation for Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free☆44Updated last month
- [SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…☆54Updated 7 months ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆34Updated 2 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆57Updated 8 months ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆39Updated last year
- ☆78Updated 5 months ago
- ☆15Updated 2 months ago