[CVPR 2025 Highlight] Interpreting Object-level Foundation Models via Visual Precision Search
☆54Nov 24, 2025Updated 3 months ago
Alternatives and similar repositories for VPS
Users that are interested in VPS are comparing it to the libraries listed below
Sorting:
- [ACM MM21] Official Code: Identity-Preserving Face Anonymization via Adaptively Facial Attributes Obfuscation☆18Jun 5, 2024Updated last year
- [NLPCC'23] ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple Oracles PyTorch Implementation☆14Oct 7, 2023Updated 2 years ago
- Official implement of our work: Online Estimating Weight of White Pekin Duck Carcass by Computer Vision☆35Dec 15, 2022Updated 3 years ago
- [KBS] PCAE: A Framework of Plug-in Conditional Auto-Encoder for Controllable Text Generation PyTorch Implementation☆26Apr 10, 2023Updated 2 years ago
- [TPAMI 2025] Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object Detection☆53Jul 1, 2025Updated 8 months ago
- [TNNLS, to appear] FET-LM: Flow Enhanced Variational Auto-Encoder for Topic-Guided Language Modeling PyTorch Implementation☆14Mar 4, 2023Updated 3 years ago
- [Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling PyTorch Implementation☆37Oct 18, 2023Updated 2 years ago
- [Tool] AutoRec (2015) PyTorch Implementation☆10Mar 1, 2020Updated 6 years ago
- [Paperlist] Awesome paper list of controllable text generation via latent auto-encoders. Contributions of any kind are welcome.☆52Dec 23, 2022Updated 3 years ago
- [ISSTA 2025] Unlocking Low Frequency Syscalls in Kernel Fuzzing with Dependency-Based RAG☆52Jan 29, 2026Updated last month
- NEUQ测控专业的考试资料☆21Sep 21, 2022Updated 3 years ago
- [Paperlist] Awesome paper list of multimodal dialog, including methods, datasets and metrics☆37Jan 22, 2025Updated last year
- 2020年秋国科大模式识别(刘成林、向世明、张煦尧)课后作业☆10Feb 3, 2021Updated 5 years ago
- visual point clouds (with bbox) by Plotly☆15Nov 10, 2021Updated 4 years ago
- 2021-2022国科大强化学习格斗游戏大作业☆37Jun 11, 2022Updated 3 years ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Jan 17, 2026Updated last month
- The codebase for ABAW4 challenge of ECCV2022 workshop.☆21Jun 18, 2023Updated 2 years ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Sep 15, 2023Updated 2 years ago
- rmp data ranking☆13Nov 4, 2025Updated 4 months ago
- Official PyTorch implementation for paper "ProAPO: Progressively Automatic Prompt Optimization for Visual Classification". The paper is a…☆28Nov 9, 2025Updated 3 months ago
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆40Apr 18, 2025Updated 10 months ago
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Feb 5, 2024Updated 2 years ago
- A music composer and player with MATLAB☆11Mar 14, 2020Updated 5 years ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆53Jul 23, 2025Updated 7 months ago
- Tools for generating single-cell gene expression data☆34Jun 20, 2025Updated 8 months ago
- [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"☆86Nov 28, 2023Updated 2 years ago
- High Security Surveillance Camera using OpenCV, Python & Arduino☆12Jun 20, 2020Updated 5 years ago
- [ISBI 2024] Official implementation of GLOBAL-LOCAL (FREQUENCY) FILTER NETWORKS FOR EFFICIENT MEDICAL IMAGE SEGMENTATION☆14May 28, 2024Updated last year
- This is the repository for the source code of the paper "Structure-Aware Single-Source Generalization with Pixel-Level Disentanglement fo…☆19Dec 22, 2024Updated last year
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 3 months ago
- Code for the paper "CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification", Comput…☆17Nov 24, 2025Updated 3 months ago
- SpringBoot和VUE的前后端分离开发入门项目---车辆管理系统前端☆12Dec 12, 2022Updated 3 years ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆16Mar 23, 2025Updated 11 months ago
- ☆11May 16, 2025Updated 9 months ago
- Bird's Eye View Calibration Toolkit☆17Jun 21, 2025Updated 8 months ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated last week
- ☆13Jun 9, 2025Updated 8 months ago
- SAM2PATH paper code repo☆44Feb 25, 2025Updated last year
- Code of Feature Fusion Transferability Aware Transformer for Unsupervised Domain Adaptation, WACV 2025☆10Dec 5, 2024Updated last year