Adapting Self-Supervised Representations as a Latent Space for Efficient Generation
☆40Oct 17, 2025Updated 4 months ago
Alternatives and similar repositories for RepTok
Users that are interested in RepTok are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆164Jan 7, 2026Updated last month
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- ☆95Jul 24, 2025Updated 7 months ago
- RLHF for Video Diffusion Models☆23Jul 30, 2025Updated 6 months ago
- ☆171Jan 8, 2026Updated last month
- [Arxiv'25] DINO-Tok: Adapting DINO for Visual Tokenizers☆35Nov 25, 2025Updated 3 months ago
- [ICLR 2026] PixNerd: Pixel Neural Field Diffusion☆170Dec 10, 2025Updated 2 months ago
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT model☆13Sep 25, 2024Updated last year
- ☆31Dec 8, 2023Updated 2 years ago
- the official code of "Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation" (ECCV2024)☆13Jan 14, 2025Updated last year
- Frequency Autoregressive Image Generation with Continuous Tokens☆94Jun 9, 2025Updated 8 months ago
- [CVPR 2026] DDT: Decoupled Diffusion Transformer☆363Aug 22, 2025Updated 6 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆35May 23, 2024Updated last year
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Jan 19, 2024Updated 2 years ago
- ☆38Feb 6, 2025Updated last year
- Code for the paper "Consistency Regularization for Certified Robustness of Smoothed Classifiers" (NeurIPS 2020)☆35Jan 11, 2021Updated 5 years ago
- Implementation of the paper "MaskBit: Embedding-free Image Generation from Bit Tokens"☆88Apr 10, 2025Updated 10 months ago
- ComfyUI workflows to create smooth transitions between video clips using Wan VACE. Works with video from any model or other source-LTX-2,…☆30Feb 10, 2026Updated 2 weeks ago
- Scripts for the Lustre File System and Robinhood Policy Engine☆10Aug 31, 2023Updated 2 years ago
- ☆10Sep 17, 2022Updated 3 years ago
- ☆10Sep 4, 2021Updated 4 years ago
- Amazon S3 tokenizer☆10Feb 19, 2026Updated last week
- Where is the "main theme" in an orchestral score?☆12Oct 25, 2025Updated 4 months ago
- [ICLR'24] Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition☆55May 14, 2024Updated last year
- ☆15Mar 11, 2025Updated 11 months ago
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders☆208Feb 13, 2026Updated 2 weeks ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆183Mar 20, 2025Updated 11 months ago
- Joint Embedding Predictive Architecture for Musical Stem Compatibility Estimation☆48Aug 6, 2024Updated last year
- PyTorch Implementation of "ZstGAN: An Adversarial Approach for Unsupervised Zero-Shot Image-to-Image Translation"☆42Jun 1, 2019Updated 6 years ago
- Codes for "Learning bounds for risk-sensitive learning," NeurIPS 2020 (or see arXiv 2006.08138)☆11Oct 15, 2020Updated 5 years ago
- ☆37Oct 29, 2025Updated 3 months ago
- GPU accelerated Perlin Noise in python☆11Oct 23, 2020Updated 5 years ago
- Today I Learnd☆10Mar 30, 2021Updated 4 years ago
- PAM module for Auth0☆12Apr 20, 2020Updated 5 years ago
- Contrastive self-supervised learning using Rényi divergence☆14Oct 21, 2022Updated 3 years ago
- An agentic runtime that enables secure, extensible and configurable AI automation from any model☆17Updated this week
- Image Tokenizer Needs Post-Training☆24Oct 4, 2025Updated 4 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- ☆17Jun 24, 2025Updated 8 months ago