CatworldLee / Gaussian-Mixture-Mask-Attention
☆8Updated 6 months ago
Alternatives and similar repositories for Gaussian-Mixture-Mask-Attention
Users that are interested in Gaussian-Mixture-Mask-Attention are comparing it to the libraries listed below
Sorting:
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 4 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆29Updated 7 months ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels"☆14Updated 3 months ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆20Updated 9 months ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆13Updated 8 months ago
- ☆13Updated 8 months ago
- Official implementation of the WACV 2025 paper "3D Part Segmentation via Geometric Aggregation of 2D Visual Features"☆18Updated 2 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Updated 10 months ago
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated last year
- ☆24Updated last year
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆20Updated last year
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 2 months ago
- Moved to https://github.com/NUS-HPC-AI-Lab/InfoBatch☆6Updated last year
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 7 months ago
- ☆41Updated 6 months ago
- The official implementation of the paper "Asymmetric Polynomial Loss for Multi-Label Classification"(ICASSP 2023)☆21Updated 2 years ago
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆12Updated 11 months ago
- Control LLM☆14Updated last month
- Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆14Updated 5 months ago
- Code for the paper: "POET: Prompt Offset Tuning for Continual Human Action Adaptation" (ECCV 2024, Oral)☆11Updated 3 weeks ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated 6 months ago
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆12Updated last month
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆12Updated 2 months ago
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆12Updated 4 months ago
- ☆17Updated 9 months ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆19Updated last year
- ☆16Updated 9 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆17Updated 10 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆20Updated last year
- Official repo of Progressive Data Expansion: data, code and evaluation☆29Updated last year