StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation
☆43Jun 6, 2025Updated 8 months ago
Alternatives and similar repositories for StyleAR
Users that are interested in StyleAR are comparing it to the libraries listed below
Sorting:
- ☆19Aug 19, 2024Updated last year
- [IJCAI 2022 poster] PyTorch Implementation of "Universal Video Style Transfer via Crystallization, Separation, and Blending"☆17Mar 10, 2023Updated 2 years ago
- DiP: Taming Diffusion Models in Pixel Space☆55Nov 27, 2025Updated 3 months ago
- Generate image at any resolution.☆42Sep 16, 2025Updated 5 months ago
- OmniStyle: Filtering High Quality Style Transfer Data at Scale (CVPR 2025)☆34Aug 9, 2025Updated 6 months ago
- This is the official Tensorflow implementation of our paper: "DualAST: Dual Style-Learning Networks for Artistic Style Transfer"☆25Nov 10, 2021Updated 4 years ago
- PyTorch Code for "All-to-key Attention for Arbitrary Style Transfer" (Accepted by ICCV 2023)☆24Aug 8, 2023Updated 2 years ago
- Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning☆45Jul 2, 2025Updated 8 months ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- [ACL 2025] The official pytorch implement of "MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection".☆26May 26, 2025Updated 9 months ago
- This is a helper extension for ComfyUI that assists with node connections.☆46Apr 7, 2025Updated 10 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Feb 27, 2025Updated last year
- Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’☆21Dec 19, 2025Updated 2 months ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆32Jul 9, 2024Updated last year
- ☆35Nov 5, 2024Updated last year
- ComfyUI-AutoSplitGridImage: A custom node for ComfyUI that intelligently splits images into grids, combining edge detection for columns a…☆43Jan 6, 2025Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆16Jun 12, 2025Updated 8 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- A small set of unique adapters meant to bridge the dual_stream_shunt trained for guiding prompt embeddings and diffusion.☆14Nov 26, 2025Updated 3 months ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- TensorFlow code for our ICCV 2019 paper "Multimodal Style Transfer via Graph Cuts"☆41Dec 20, 2019Updated 6 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- Using image caption models to extract prompts in ComfyUI☆10May 21, 2025Updated 9 months ago
- AI Prompt Factory - Automated High-Quality Prompt Suite Generation After the system runs, various roles required by agents will be creat…☆17Dec 20, 2025Updated 2 months ago
- RIFE with IFUNet, FusionNet and RefineNet☆12Jun 30, 2022Updated 3 years ago
- Make HiDream-I1 avialbe in ComfyUI.☆10Apr 14, 2025Updated 10 months ago
- A VapourSynth filter that displays the FFT frequency spectrum of a given clip.☆12Dec 12, 2021Updated 4 years ago
- AI Deinterlacing functions for Vapoursynth☆17Nov 4, 2025Updated 4 months ago
- The official code of "Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling"☆47Updated this week
- ☆11Nov 29, 2024Updated last year
- waifu年龄检测器!☆15Feb 22, 2025Updated last year
- ComfyUI nodes to use DiLightNet☆11Oct 6, 2024Updated last year
- ☆11Sep 26, 2024Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated 2 years ago
- singing voice conversion based on glow-tts☆12Aug 20, 2023Updated 2 years ago