BOBrown / JADF-caffeLinks
Joint Distribution Alignment via Adversarial Learning for Domain Adaptive Object Detection
☆11Updated 3 years ago
Alternatives and similar repositories for JADF-caffe
Users that are interested in JADF-caffe are comparing it to the libraries listed below
Sorting:
- The code of source-only training for our method☆11Updated 3 years ago
- A implementation of centerloss in multi_box_loss☆60Updated 6 years ago
- [ACL 2025 Best Theme Paper] This is the official implementation for the paper: "Meta-rater: A Multi-dimensional Data Selection Method for…☆188Updated 5 months ago
- Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models☆185Updated last year
- DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Models☆152Updated last year
- Official implementation of ECCV2022 paper End-to-End Weakly Supervised Object Detection with Sparse Proposal Evolution☆103Updated 2 years ago
- Explainable Person Re-Identification with Attribute-guided Metric Distillation☆99Updated 3 years ago
- An ultra fast tiny model for lane detection, using onnx_parser, TensorRTAPI, torch2trt to accelerate. our model support for int8, dynamic…☆119Updated 4 years ago
- (NeurIPS 2024) Official PyTorch implementation of LOVA3☆90Updated 10 months ago
- [ICCV2025] II-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting☆172Updated 3 months ago
- u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model☆134Updated 10 months ago
- [ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache☆43Updated last year
- Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation☆124Updated 5 months ago
- ☆19Updated 5 years ago
- [NeurIPS 2025] Official implementation for the paper "SeePhys: Does Seeing Help Thinking? -- Benchmarking Vision-Based Physics Reasoning"☆48Updated 4 months ago
- [ACMMM 2025] Officially implement of the paper "DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompti…☆217Updated 9 months ago
- code for AAAI 2020 paper "ACT"☆88Updated 2 years ago
- AI2-THOR Data Collection Tool Based On Keyboard Interaction☆53Updated last year
- CoS: Chain-of-Shot Prompting for Long Video Understanding☆53Updated 11 months ago
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆114Updated 10 months ago
- ☆28Updated 6 months ago
- Pruning Filter in Filter(NeurIPS2020)☆148Updated last year
- ☆147Updated 11 months ago
- Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning☆249Updated last year
- [ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"☆188Updated 4 months ago
- [AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding☆301Updated last week
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts☆87Updated 3 months ago
- ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy☆305Updated 8 months ago
- ☆54Updated 9 months ago
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆35Updated last year