tinglyfeng / figure_for_data_analysisLinks
☆10Updated 2 years ago
Alternatives and similar repositories for figure_for_data_analysis
Users that are interested in figure_for_data_analysis are comparing it to the libraries listed below
Sorting:
- The official implementation of the paper "DIP: Dual Incongruity Perceiving Network for Sarcasm Detection"☆34Updated 8 months ago
- ☆82Updated 10 months ago
- ☆71Updated 4 months ago
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆58Updated last month
- Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆62Updated 3 months ago
- [CVPR 2024] This is the official implementation of "MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Disti…☆18Updated 2 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆68Updated last month
- [ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding☆54Updated last month
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆192Updated last month
- 对llava官方代码的一些学习笔记☆30Updated 10 months ago
- Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]☆55Updated 3 months ago
- [ICCV 2025] Official PyTorch Code for "Advancing Textual Prompt Learning with Anchored Attributes"☆89Updated last week
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding☆98Updated last week
- [ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling☆117Updated 2 weeks ago
- The official implementation of RAR☆91Updated last year
- [CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga☆117Updated last week
- Multimodal-Composite-Editing-and-Retrieval-update☆33Updated 10 months ago
- Official Implementation for MoPE: Parameter-Efficient and Scalable Multimodal Fusion via Mixture of Prompt☆22Updated last month
- [ECCV 2024] Official repository of ECCV 2024 paper: Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion M…☆15Updated 3 months ago
- ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆17Updated 2 months ago
- This is the official implementation of 2025 CVPR paper "EmoEdit: Evoking Emotions through Image Manipulation".☆27Updated 5 months ago
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.☆42Updated 10 months ago
- [SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval☆42Updated last year
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆17Updated last year
- [CVPR 2025] LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant☆142Updated last month
- [ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'☆253Updated 4 months ago
- TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning☆101Updated 3 months ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆11Updated 4 months ago
- [CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆27Updated this week
- A comprehensive survey of Composed Multi-modal Retrieval (CMR), including Composed Image Retrieval (CIR) and Composed Video Retrieval (CV…☆51Updated 2 weeks ago