BryceZhuo / PolyCom
The official implementation of Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models.
☆11Updated last month
Alternatives and similar repositories for PolyCom:
Users that are interested in PolyCom are comparing it to the libraries listed below
- Official implementation of ECCV24 paper: POA☆24Updated 5 months ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆25Updated 2 months ago
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models☆15Updated last week
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆18Updated 2 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆13Updated this week
- Official PyTorch Implementation for Task Vectors are Cross-Modal☆21Updated last month
- Efficient Mixture of Experts for LLM Paper List☆26Updated last month
- This repo contains code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation"☆10Updated last week
- The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsifi…☆15Updated last month
- PyTorch implementation of StableMask (ICML'24)☆12Updated 6 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆13Updated 6 months ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆18Updated this week
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated 2 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆13Updated 3 months ago
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Updated 2 months ago
- This is the official repo for ByteVideoLLM/Dynamic-VLM☆18Updated last month
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆33Updated 3 months ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆11Updated 10 months ago
- ☆18Updated 7 months ago
- Retrieval-Augmented Personalization☆12Updated last month
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆28Updated 3 months ago
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆21Updated 3 months ago
- ☆15Updated last week
- ☆15Updated 5 months ago
- [AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…☆18Updated last week
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated 3 months ago
- Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning☆14Updated 2 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆40Updated 9 months ago