Qinyu-Allen-Zhao / ArinarView external linksLinks
☆43May 30, 2025Updated 8 months ago
Alternatives and similar repositories for Arinar
Users that are interested in Arinar are comparing it to the libraries listed below
Sorting:
- [ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments☆20Aug 19, 2025Updated 5 months ago
- hierarchical multi-agent workflow for prompt optimazation☆14Jun 12, 2024Updated last year
- Code for ICLR 2024 paper "Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection"☆16Apr 20, 2024Updated last year
- CIFAR-10-Warehouse: Towards Broad and More Realistic Testbeds in Model Generalization Analysis☆18Jul 15, 2024Updated last year
- Code for our paper: Learning Camera Movement Control from Real-World Drone Videos☆34Apr 16, 2025Updated 10 months ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆42Nov 1, 2024Updated last year
- Taylor videos and Taylor-transformed skeletons (ICML 2024).☆16Jul 25, 2024Updated last year
- This repository includes various baseline techniques for label-free model evaluation task for the VDU2023 competition.☆19Mar 8, 2023Updated 2 years ago
- Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Fee…☆27Nov 11, 2023Updated 2 years ago
- ☆10Aug 31, 2023Updated 2 years ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 7 months ago
- Exposing Text-Image Inconsistency Using Diffusion Models (ICLR 2024)☆10Jun 15, 2024Updated last year
- [AAAI 2025] Official pytorch implementation of "Diffusion Model Patching via Mixture-of-Prompts"☆13Dec 12, 2024Updated last year
- Video Diffusion Transformers are In-Context Learners☆36Jan 6, 2025Updated last year
- ☆13Jul 10, 2024Updated last year
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆451Dec 6, 2025Updated 2 months ago
- [ICCV 2025] The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆58Apr 5, 2025Updated 10 months ago
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆144May 27, 2025Updated 8 months ago
- PyTorch implementation of "Sample- and Parameter-Efficient Auto-Regressive Image Models" from CVPR 2025☆14Nov 21, 2025Updated 2 months ago
- Separable Diffusion Model Unlearning☆13Jan 29, 2025Updated last year
- ☆15Mar 30, 2025Updated 10 months ago
- A Chrome/Edge extension to help you quickly scan through the flood of daily ArXiv papers.☆15Mar 29, 2025Updated 10 months ago
- Automatic model evaluation (AutoEval) in CVPR'21&TPAMI'22☆37Oct 20, 2022Updated 3 years ago
- Official code for CustAny: Customizing Anything from A Single Example. Accepted by CVPR2025 (Oral)☆48Apr 10, 2025Updated 10 months ago
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆82Feb 7, 2026Updated last week
- PyTorch Implementation of MIMO (ICLR 2021)☆16Dec 16, 2022Updated 3 years ago
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated 11 months ago
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆264Apr 7, 2025Updated 10 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆95Mar 1, 2025Updated 11 months ago
- ☆17Jun 18, 2024Updated last year
- (TPAMI'2024) ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation☆22Aug 8, 2024Updated last year
- ☆21Oct 10, 2024Updated last year
- Diffusion generation on Mesh toolbox☆23Feb 10, 2025Updated last year
- [ICCV2025] TokenBridge: Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation. https://yuqingwang1029.github.io/To…☆151Jul 24, 2025Updated 6 months ago
- ☆27Mar 3, 2025Updated 11 months ago
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 6 months ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Mar 18, 2025Updated 10 months ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆32Mar 26, 2025Updated 10 months ago