BryceZhuo / PolyComLinks
The official implementation of ICLR 2025 paper "Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models".
☆14Updated 2 months ago
Alternatives and similar repositories for PolyCom
Users that are interested in PolyCom are comparing it to the libraries listed below
Sorting:
- ☆42Updated 7 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆20Updated 8 months ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆22Updated 10 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆18Updated 8 months ago
- ☆43Updated 5 months ago
- [CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention☆22Updated 3 months ago
- ☆18Updated last month
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 3 months ago
- [CVPR] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization☆31Updated this week
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆18Updated 4 months ago
- RewardAnything: Generalizable Principle-Following Reward Models☆22Updated 2 weeks ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆15Updated 2 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆24Updated 6 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 10 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆27Updated last month
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models☆19Updated 4 months ago
- ☆47Updated last year
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆23Updated 4 months ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆31Updated 2 months ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆20Updated 2 months ago
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆28Updated 2 months ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆16Updated 7 months ago
- [ICCV 2025] Dynamic-VLM☆20Updated 6 months ago
- ☆14Updated last month
- CLIP-MoE: Mixture of Experts for CLIP☆42Updated 8 months ago
- LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models☆18Updated 2 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆15Updated 11 months ago
- ☆16Updated 2 months ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Updated this week
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆19Updated 2 months ago