Jiahao000 / VICTView external linksLinks
[CVPR 2025] Test-Time Visual In-Context Tuning
☆29Dec 31, 2025Updated last month
Alternatives and similar repositories for VICT
Users that are interested in VICT are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of the paper "Equivariant Image Modeling"(https://arxiv.org/abs/2503.18948)☆34Aug 1, 2025Updated 6 months ago
- Landsat-Bench: Datasets and Benchmarks for Landsat Foundation Models☆18Jun 18, 2025Updated 7 months ago
- Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024☆14Sep 30, 2024Updated last year
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆41Feb 12, 2025Updated last year
- A benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency o…☆24Aug 7, 2025Updated 6 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆20Oct 31, 2024Updated last year
- [CVPR2025] Official code repository for SeTa: "Scale Efficient Training for Large Datasets"☆23Mar 18, 2025Updated 10 months ago
- A LLM model for space understanding☆24Sep 12, 2025Updated 5 months ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- Official repository for CVPR 2025 paper: OpenSDI: Spotting Diffusion-Generated Images in the Open World☆39Jul 8, 2025Updated 7 months ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆54Apr 9, 2025Updated 10 months ago
- ☆35Feb 5, 2024Updated 2 years ago
- ☆91Jan 18, 2026Updated 3 weeks ago
- Codes for Arctic river segmentation using various fully convolutional neural networks.☆10Dec 27, 2022Updated 3 years ago
- [ICCV 2025] HQ-CLIP: Leveraging Large Vision-Language Models to Create High-Quality Image-Text Datasets☆62Aug 6, 2025Updated 6 months ago
- This is the official repository of the paper "SAR-TEXT: A Large-Scale SAR Image-Text Dataset Built with SAR-Narrator and Progressive Tran…☆30Oct 22, 2025Updated 3 months ago
- ☆12Apr 1, 2025Updated 10 months ago
- Scaling Properties of Diffusion Models For Perceptual Tasks (CVPR 2025)☆44May 1, 2025Updated 9 months ago
- Multi-Reward as Condition for Instruction-Based Image Editing☆58Mar 18, 2025Updated 10 months ago
- Official code for paper: F3D-Gaus: Feed-forward 3D-aware Generation on ImageNet with Cycle-Aggregative Gaussian Splatting☆50Mar 11, 2025Updated 11 months ago
- EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆27Jul 30, 2025Updated 6 months ago
- [TGRS'25] AirSpatialBot: A Spatially-Aware Aerial Agent for Fine-Grained Vehicle Attribute Recognization and Retrieval☆29Jan 6, 2026Updated last month
- ☆16Sep 1, 2025Updated 5 months ago
- ☆14Jan 5, 2022Updated 4 years ago
- [AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for General…☆67Dec 1, 2025Updated 2 months ago
- [MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology☆12Jun 17, 2025Updated 7 months ago
- ☆12Dec 20, 2024Updated last year
- [CVPR 2025] GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration☆20Mar 21, 2025Updated 10 months ago
- Papers about the ultra high resolution tasks.☆13Jul 12, 2024Updated last year
- ☆20Oct 15, 2025Updated 3 months ago
- [EMNLP 2022] RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees☆11Jul 15, 2023Updated 2 years ago
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆53Jan 22, 2025Updated last year
- Implementation of "Conditional Score Guidance for Text-Driven Image-to-Image Translation" (NeurIPS 2023).☆11Jul 19, 2023Updated 2 years ago
- ☆12Apr 18, 2025Updated 9 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆13Mar 30, 2024Updated last year
- ☆11Oct 29, 2024Updated last year
- [ICML 2025] Improving Planning of Agents for Long-Horizon Tasks☆22Oct 2, 2025Updated 4 months ago
- Deep Learning Framework with a specialisation aimed for Binarized Neural Networks.☆11Jan 9, 2022Updated 4 years ago
- Code to reproduce experiments in Markovian Flow Matching: Accelerating MCMC with Continuous Normalizing Flows☆13May 23, 2024Updated last year