HelmholtzAI-FZJ / flex_genLinks

☆17

Alternatives and similar repositories for flex_gen

Users that are interested in flex_gen are comparing it to the libraries listed below

Sorting:

locuslab / llava-token-compression
☆42Updated 8 months ago
TIGER-AI-Lab / VISTA
The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]
☆18Updated 4 months ago
philippe-eecs / small-vision
A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.
☆34Updated last year
g-luo / vlm_cross_modal_reps
Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025
☆27Updated 2 months ago
MarkXCloud / CSpD
The official repo of continuous speculative decoding
☆27Updated 3 months ago
Optimization-AI / FastCLIP
Distributed Optimization Infra for learning CLIP models
☆26Updated 9 months ago
zhixuan-lin / forgetting-transformer
[ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"
☆116Updated last week
CompVis / DisCLIP
[AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?
☆17Updated 6 months ago
pixeli99 / MixLN
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆25Updated 6 months ago
TencentARC / GRPO-CARE
☆64Updated 3 weeks ago
caojiaolong / Awesome-Mamba
Collect papers about Mamba (a selective state space model).
☆14Updated 11 months ago
fla-org / flash-bidirectional-linear-attention
Triton implement of bi-directional (non-causal) linear attention
☆52Updated 5 months ago
philippe-eecs / vitok
☆32Updated 2 months ago
MikaStars39 / StableMask
PyTorch implementation of StableMask (ICML'24)
☆13Updated last year
Gen-Verse / HermesFlow
HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation
☆63Updated 5 months ago
MengLcool / DeepStack-VL
[NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…
☆37Updated last year
LINs-lab / GMem
[Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models
☆38Updated 4 months ago
theAdamColton / vq-clip
Train vector quantized CLIP models using pytorch lightning
☆20Updated last year
ggjy / vision_weak_to_strong
☆38Updated last year
kuleshov-group / remdm
Remasking Discrete Diffusion Models with Inference-Time Scaling
☆34Updated 4 months ago
lzw-lzw / UnifiedMLLM
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
☆22Updated 11 months ago
yikangshen / MoA
Mixture of Attention Heads
☆47Updated 2 years ago
chuanyang-Zheng / DAPE
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆38Updated 9 months ago
chenllliang / DnD-Transformer
[ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…
☆76Updated 7 months ago
RenShuhuai-Andy / NBP
Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling
☆37Updated 5 months ago
ziplab / SN-Netv2
[ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".
☆27Updated last year
yuecao0119 / MMInstruct
[SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…
☆55Updated 8 months ago
ApexGen-X / MergeVQ
[CVPR] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization
☆36Updated 3 weeks ago
alhojel / visual_task_vectors
☆38Updated 11 months ago
jeykigung / HiCLIP
☆29Updated 2 years ago