MeganTj / multimodal_alignment
☆11Updated 3 weeks ago
Alternatives and similar repositories for multimodal_alignment:
Users that are interested in multimodal_alignment are comparing it to the libraries listed below
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated last month
- ☆18Updated 9 months ago
- Official PyTorch Implementation for Task Vectors are Cross-Modal☆22Updated 4 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆25Updated 5 months ago
- Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆14Updated 5 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 8 months ago
- Holistic evaluation of multimodal foundation models☆47Updated 8 months ago
- ☆18Updated 5 months ago
- PyTorch implementation of StableMask (ICML'24)☆12Updated 9 months ago
- ☆31Updated 3 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆9Updated 9 months ago
- ☆25Updated 6 months ago
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)☆19Updated 5 months ago
- Project for SNARE benchmark☆11Updated 10 months ago
- ☆10Updated 5 months ago
- ☆14Updated 3 months ago
- Official Code for ACL 2023 Outstanding Paper: World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Languag…☆32Updated last year
- DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding☆49Updated 3 weeks ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆20Updated 8 months ago
- ☆41Updated 5 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆18Updated 9 months ago
- Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"☆13Updated last year
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆19Updated last year
- ☆17Updated 3 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Updated last year
- [ICLR'25] "Understanding Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing" by Peihao Wang, Ruisi Cai, Yue…☆11Updated last month
- Accompanies the EMNLP 2024 paper: "Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions". This repo featur…☆19Updated 2 months ago
- MIO: A Foundation Model on Multimodal Tokens☆25Updated 4 months ago
- More dimensions = More fun☆22Updated 8 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆20Updated 5 months ago