Adapting Self-Supervised Representations as a Latent Space for Efficient Generation
☆40Oct 17, 2025Updated 5 months ago
Alternatives and similar repositories for RepTok
Users that are interested in RepTok are comparing it to the libraries listed below
Sorting:
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- [ICCV 2025] SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models☆45Nov 5, 2025Updated 4 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆173Mar 6, 2026Updated 2 weeks ago
- ☆175Jan 8, 2026Updated 2 months ago
- ☆98Jul 24, 2025Updated 7 months ago
- ☆31Dec 8, 2023Updated 2 years ago
- [WACV 2025] DistillDIFT: Distillation of Diffusion Features for Semantic Correspondence☆35Jul 10, 2025Updated 8 months ago
- [Arxiv'25] DINO-Tok: Adapting DINO for Visual Tokenizers☆35Nov 25, 2025Updated 3 months ago
- [ICLR 2026] PixNerd: Pixel Neural Field Diffusion☆173Dec 10, 2025Updated 3 months ago
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆25Aug 5, 2025Updated 7 months ago
- [CVPR 2026] DDT: Decoupled Diffusion Transformer☆373Aug 22, 2025Updated 6 months ago
- ☆38Feb 6, 2025Updated last year
- Geometry style transfer colorbook☆20Jan 5, 2024Updated 2 years ago
- Frequency Autoregressive Image Generation with Continuous Tokens☆94Jun 9, 2025Updated 9 months ago
- the official code of "Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation" (ECCV2024)☆13Jan 14, 2025Updated last year
- a collection of awesome autoregressive visual generation models☆80Apr 17, 2025Updated 11 months ago
- Official Implementation of "Semantics-Consistent Feature Search for Self-Supervised Visual Representation Learning" in AAAI2024.☆13Feb 28, 2024Updated 2 years ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Aug 28, 2023Updated 2 years ago
- ☆70Dec 5, 2025Updated 3 months ago
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆88Apr 10, 2025Updated 11 months ago
- Implementation of "Conditional Score Guidance for Text-Driven Image-to-Image Translation" (NeurIPS 2023).☆11Jul 19, 2023Updated 2 years ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆18Apr 11, 2025Updated 11 months ago
- AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis☆12Oct 3, 2024Updated last year
- [NeurIPS 2021] Code for Learning Signal-Agnostic Manifolds of Neural Fields☆68Mar 3, 2023Updated 3 years ago
- [ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation☆70Feb 12, 2026Updated last month
- Contrastive self-supervised learning using Rényi divergence☆14Oct 21, 2022Updated 3 years ago
- [ICLR'24] Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition☆55May 14, 2024Updated last year
- [ICLR 2025] Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆51Apr 21, 2025Updated 10 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆35May 23, 2024Updated last year
- [NeurIPS 2025 Datasets & Benchmarks Track] The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models☆34Oct 26, 2025Updated 4 months ago
- Code for the paper "Consistency Regularization for Certified Robustness of Smoothed Classifiers" (NeurIPS 2020)☆35Jan 11, 2021Updated 5 years ago
- Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…☆16Jun 28, 2023Updated 2 years ago
- Official PyTorch Implementation of Exploring Stochastic Autoregressive Image Modeling for Visual Representation, Accepted by AAAI 2023.☆16Jul 3, 2023Updated 2 years ago
- ☆125Aug 19, 2025Updated 7 months ago
- ☆13Nov 29, 2024Updated last year
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆27Mar 4, 2025Updated last year
- Repository for the T2D/obesity experiments run in the Metapheno paper☆15Feb 6, 2019Updated 7 years ago
- [ICML 2025] Diff-MoE: Diffusion Transformer with Time-Aware and Space-Adaptive Experts☆29Nov 10, 2025Updated 4 months ago
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing☆24Nov 20, 2025Updated 3 months ago