ReNeg: Learning Negative Embedding with Reward Guidance
☆35Dec 22, 2025Updated 2 months ago
Alternatives and similar repositories for ReNeg
Users that are interested in ReNeg are comparing it to the libraries listed below
Sorting:
- ReNeg: Learning Negative Embedding with Reward Guidance☆17Jan 17, 2025Updated last year
- This is an official PyTorch Implementation of Neighbor Relations Matter in Video Scene Detection.☆28Mar 19, 2025Updated last year
- Official repo for "Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge" ICLR2025☆101Mar 14, 2025Updated last year
- CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification(AAAI2025)☆47Nov 24, 2025Updated 3 months ago
- TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification (AAAI2024)☆65Apr 16, 2024Updated last year
- ☆10Sep 1, 2020Updated 5 years ago
- Code for paper "LLMs Can Evolve Continually on Modality for X-Modal Reasoning" NeurIPS2024☆41Dec 18, 2024Updated last year
- ☆13Jul 10, 2024Updated last year
- MagicVFX: Visual Effects Synthesis in Just Minutes☆16Dec 16, 2024Updated last year
- AMD 0.9B efficient text to video diffusion model☆44Jan 12, 2026Updated 2 months ago
- Code for the paper "Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos"☆14May 3, 2024Updated last year
- [World-Model-Survey-2024] Paper list and projects for World Model☆15Oct 31, 2024Updated last year
- The official implementation of the TIP 2025 paper UncTrack: Reliable Visual Object Tracking with Uncertainty-Aware Prototype Memory Netwo…☆14Jun 16, 2025Updated 9 months ago
- ☆17Jul 30, 2024Updated last year
- [TMM 2025] StableIdentity: Inserting Anybody into Anywhere at First Sight 🔥☆260Dec 26, 2024Updated last year
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated last month
- Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization☆62Sep 19, 2025Updated 6 months ago
- ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6☆12Oct 16, 2024Updated last year
- EVE Series: Encoder-Free Vision-Language Models from BAAI☆368Jul 24, 2025Updated 7 months ago
- This is the official implementation of work HiM2SAM in PRCV25.☆25Aug 30, 2025Updated 6 months ago
- Nitro-T is a family of text-to-image diffusion models focused on highly efficient training.☆40Jul 10, 2025Updated 8 months ago
- [ICCV2023] Isomer: Isomerous Transformer for Zero-Shot Video Object Segmentation☆30Nov 21, 2023Updated 2 years ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 4 months ago
- Code for the paper "ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions" published at CVPR 2025☆21Mar 16, 2025Updated last year
- [NeurIPS 2024]Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs☆37Dec 3, 2024Updated last year
- OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework☆12Feb 27, 2025Updated last year
- The code of the paper "Free-Lunch Color-Texture Disentanglement for Stylized Image Generation"☆36Sep 18, 2025Updated 6 months ago
- [CVPR2024] The code of "UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory"☆69Oct 15, 2024Updated last year
- T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)☆46Oct 6, 2025Updated 5 months ago
- [PR 2024] Official PyTorch Code for "Dual Teachers for Self-Knowledge Distillation"☆13Nov 28, 2024Updated last year
- Project of Knowledge-Based Systems paper: MCT-Net: Multi-hierarchical cross transformer for hyperspectral and multispectral image fusion.☆32Apr 6, 2023Updated 2 years ago
- [AAAI 2026] Relation-R1: Progressively Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relation Comprehension☆18Mar 6, 2026Updated 2 weeks ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆635Oct 29, 2025Updated 4 months ago
- The source code for "LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction"☆10Jul 5, 2024Updated last year
- [AAAI2026] CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT Tracking☆36Feb 12, 2026Updated last month
- Developer project for getting basic API integrations working in under 5 minutes☆11Jan 30, 2026Updated last month
- Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation☆23Jul 30, 2025Updated 7 months ago
- HS-Diffusion: Semantic-Mixing Diffusion for Head Swapping☆90May 10, 2024Updated last year
- A collection of HRNet applications (Please feel freely add your applications if not included)☆27Jul 3, 2020Updated 5 years ago