ZishanShu / WaveFormerLinks
WaveFormer: Frequency-Time Decoupled Vision Modeling with Wave Equation
☆295Updated this week
Alternatives and similar repositories for WaveFormer
Users that are interested in WaveFormer are comparing it to the libraries listed below
Sorting:
- [ICLR 26] TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in f…☆843Updated 2 months ago
- [ICLR 2026]🔥🔥🔥MOSAIC: Multi-Subject Personalized Generation via Correspondence-Aware Alignment and Disentanglement☆436Updated 2 weeks ago
- 🧩 IMAGHarmony 🧩: Controllable image editing with consistent object quantity and layout. A structure-aware framework that ensures high f…☆678Updated 3 months ago
- Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion☆300Updated last month
- Official Repo of "Disentangled Reinforcement Learning for Robust Visual Quality Assessment"☆118Updated 2 weeks ago
- [AAAI 2026]🔥🔥🔥FocusDPO: Dynamic Preference Optimization for Multi-Subject Personalized Image Generation via Adaptive Focus☆381Updated 3 months ago
- [NeurIPS 2025] Native-resolution diffusion Transformer☆291Updated 3 months ago
- This is the official repository for C3-OWD: A Curriculum Cross-modal Contrastive Learning Framework for Open-World Detection☆157Updated 4 months ago
- codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"☆126Updated 5 months ago
- 🎨 IMAGGarment🎨 : Fine-Grained Garment Generation with Controllable Structure, Color, and Logo. It supports precise and customizable ga…☆266Updated 3 months ago
- ☆34Updated 9 months ago
- EvoVLA: Self-Evolving Vision-Language-Action Model☆229Updated last month
- JarvisX-Cowork: Your First Personal AI Creative Assistant for Everyone!☆209Updated 2 weeks ago
- Just having comparing hybrid ResNet50+ViT models with pure ResNet18 CNN on a mixed dataset! Wanted to see how these different architectur…☆40Updated 2 months ago
- Official implementation of ''Pixel-inconsistency modeling for image manipulation localization''☆208Updated 3 months ago
- ☆82Updated 3 months ago
- Inverse Tiling of 2D Finite Domains (Siggraph Asia 2025)☆60Updated 4 months ago
- Official repository for the paper "TIIF-Bench: How Does Your T2I Model Follow Your Instructions?".☆159Updated 2 months ago
- [IEEE TASE 2025] The Official Implementation for ''Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Clo…☆108Updated last month
- The Retina Glasklart Theme☆768Updated 7 months ago
- ☆92Updated 8 months ago
- EVA OS — A real-time multimodal AIOS for next-generation hardware, enabling your devices being “alive” and as intelligent as a real brain…☆381Updated last week
- ☆806Updated 7 months ago
- ☆169Updated 3 weeks ago
- ☆167Updated 6 months ago
- ❗ This is a read-only mirror of the CRAN R package repository. DMwR — Functions and data for "Data Mining with R"☆452Updated 6 months ago
- Coherent Video Inpainting Using Optical Flow-Guided Efficient Diffusion☆301Updated 8 months ago
- shine-ray-future official website☆99Updated last week
- Laser Odometry and Mapping (continuous spin version)☆452Updated 8 months ago
- LIRA: Reasoning Reconstruction via Multimodal Large Language Models (ICCV 2025)☆319Updated last month