NK-JittorCV / nk-diffusion
☆16Updated 8 months ago
Alternatives and similar repositories for nk-diffusion:
Users that are interested in nk-diffusion are comparing it to the libraries listed below
- An open source codebase for object detection based on Jittor☆18Updated 2 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆19Updated 2 weeks ago
- Enhancing Representations through Heterogeneous Self-Supervised Learning (TPAMI 2025)☆12Updated last week
- ☆16Updated 4 months ago
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆27Updated last month
- Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation☆50Updated 8 months ago
- [ECCV 2024] Early Preparation Pays Off: New Classifier Pre-tuning for Class Incremental Semantic Segmentation☆30Updated 2 months ago
- Official code for K-LoRA (CVPR 2025)☆102Updated 2 months ago
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆70Updated 7 months ago
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples☆51Updated 6 months ago
- Video Reasoning Segmentation☆20Updated 5 months ago
- [CVPR 2024] Official implementation of "Universal Segmentation at Arbitrary Granularity with Language Instruction"☆86Updated last year
- Official Code for 'Referring Camouflaged Object Detection (指向性伪装物体检测) ' (TPAMI 2025)☆96Updated 3 months ago
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆116Updated 5 months ago
- [ECCV 2024] Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration☆73Updated 2 months ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆82Updated 3 weeks ago
- An official code for "A Decoupled Spatio-Temporal Framework for Skeleton-based Action Segmentation".☆33Updated last year
- Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'☆56Updated 4 months ago
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation☆108Updated 3 weeks ago
- (CVPR 2025 Highlight) Official repository of paper "AODRaw: Towards RAW Object Detection in Diverse Conditions" (https://arxiv.org/pdf/24…☆11Updated last month
- Initial code for computer vision experiments☆11Updated 2 years ago
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆85Updated 3 months ago
- Exploring Feature Self-relation for Self-supervised Transformer (TPAMI 2023)☆21Updated last week
- The repository contains the official implementation of "Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation"☆40Updated 2 months ago
- ☆53Updated 7 months ago
- ☆11Updated 4 months ago
- A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of…☆63Updated 2 weeks ago
- [CVPR 2025] Mr. DETR: Instructive Multi-Route Training for Detection Transformers☆75Updated 3 weeks ago
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆197Updated 2 weeks ago
- [CVPR 2025] Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution☆102Updated last month