jiachenzhu / DyT
View external linksLinks

Code release for DynamicTanh (DyT)

☆1,033

Alternatives and similar repositories for DyT

Users that are interested in DyT are comparing it to the libraries listed below

Sorting:

LMMMEng / OverLoCK
View on GitHub
[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
☆513Dec 25, 2025Updated last month
NVlabs / MambaVision
View on GitHub
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
☆2,023Feb 9, 2026Updated last week
LTH14 / fractalgen
View on GitHub
PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437
☆1,226Feb 25, 2025Updated 11 months ago
state-spaces / mamba
View on GitHub
Mamba SSM architecture
☆17,186Jan 12, 2026Updated last month
apple / ml-sigmoid-attention
View on GitHub
☆307Apr 23, 2025Updated 9 months ago
facebookresearch / blt
View on GitHub
Code for BLT research paper
☆2,028Nov 3, 2025Updated 3 months ago
facebookresearch / perception_models
View on GitHub
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
☆2,159Updated this week
tensorgi / TPA
View on GitHub
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)
☆446Jan 26, 2026Updated 3 weeks ago
pixeli99 / MixLN
View on GitHub
[ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…
☆29Jul 24, 2025Updated 6 months ago
facebookresearch / flow_matching
View on GitHub
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes…
☆4,134Jan 5, 2026Updated last month
facebookresearch / DiT
View on GitHub
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
☆8,352May 31, 2024Updated last year
kuleshov-group / bd3lms
View on GitHub
[ICLR 2025 Oral] Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
☆956Jul 10, 2025Updated 7 months ago
hustvl / Vim
View on GitHub
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
☆3,795Feb 13, 2025Updated last year
facebookresearch / sam2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆18,503Dec 25, 2024Updated last year
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,544Mar 16, 2025Updated 11 months ago
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆22,231Updated this week
KindXiaoming / pykan
View on GitHub
Kolmogorov Arnold Networks
☆16,164Jan 19, 2025Updated last year
facebookresearch / dinov2
View on GitHub
PyTorch code and models for the DINOv2 self-supervised learning method.
☆12,393Dec 22, 2025Updated last month
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,393Dec 16, 2025Updated 2 months ago
FoundationVision / VAR
View on GitHub
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…
☆8,614Nov 10, 2025Updated 3 months ago
FarInHeight / To-Match-or-Not-to-Match
View on GitHub
Official code for "To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition" CVPR IMW 2025
☆38Oct 4, 2025Updated 4 months ago
Alic-Li / BlackGoose_Rimer
View on GitHub
BlackGoose Rimer: RWKV as a Superior Architecture for Large-Scale Time Series Modeling
☆30Jul 11, 2025Updated 7 months ago
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations of state-of-the-art linear attention models
☆4,379Updated this week
zhixuan-lin / forgetting-transformer
View on GitHub
[ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning
☆137Dec 19, 2025Updated last month
yuweihao / MambaOut
View on GitHub
MambaOut: Do We Really Need Mamba for Vision? (CVPR 2025)
☆2,654Mar 9, 2025Updated 11 months ago
tzco / Diffusion-wo-CFG
View on GitHub
Official Implementation for Diffusion Models Without Classifier-free Guidance
☆170Feb 18, 2025Updated 11 months ago
mit-han-lab / efficientvit
View on GitHub
Efficient vision foundation models for high-resolution generation and perception.
☆3,236Sep 5, 2025Updated 5 months ago
foundation-model-stack / bamba
View on GitHub
Train, tune, and infer Bamba model
☆137Jun 4, 2025Updated 8 months ago
facebookresearch / dinov3
View on GitHub
Reference PyTorch implementation and models for DINOv3
☆9,590Nov 20, 2025Updated 2 months ago
MzeroMiko / VMamba
View on GitHub
VMamba: Visual State Space Models，code is based on mamba
☆3,041Mar 7, 2025Updated 11 months ago
ML-GSAI / LLaDA
View on GitHub
Official PyTorch implementation for "Large Language Diffusion Models"
☆3,569Nov 12, 2025Updated 3 months ago
facebookresearch / vggt
View on GitHub
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
☆12,448Oct 11, 2025Updated 4 months ago
THU-MIG / yoloe
View on GitHub
YOLOE: Real-Time Seeing Anything [ICCV 2025]
☆2,037Jun 26, 2025Updated 7 months ago
MoonshotAI / MoBA
View on GitHub
MoBA: Mixture of Block Attention for Long-Context LLMs
☆2,051Apr 3, 2025Updated 10 months ago
huggingface / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆25,879Nov 24, 2025Updated 2 months ago
andrehuang / loftup
View on GitHub
[ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"
☆249Jan 13, 2026Updated last month
baaivision / Emu3
View on GitHub
Next-Token Prediction is All You Need
☆2,345Jan 12, 2026Updated last month
lumalabs / imm
View on GitHub
Official implementation of Inductive Moment Matching
☆572Jul 11, 2025Updated 7 months ago
bytedance / 1d-tokenizer
View on GitHub
This repo contains the code for 1D tokenizer and generator
☆1,113Mar 20, 2025Updated 10 months ago

jiachenzhu / DyTView external linksLinks

Alternatives and similar repositories for DyT

jiachenzhu / DyT
View external linksLinks