[AAAI 2026] SIFThinker: Spatially-Aware Image Focus for Visual Reasoning
☆23Dec 2, 2025Updated 3 months ago
Alternatives and similar repositories for SIFThinker
Users that are interested in SIFThinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CPG-SPMT: Control-oriented Parameter-Grouped Single Particle Model with Thermal effects☆51Feb 18, 2026Updated last month
- A collection of research papers on hypervisor testing.☆58Jan 31, 2026Updated last month
- HiBug-6B: A Powerful Assisting Coding LLM |专注于辅助编程的6B模型☆13Aug 10, 2023Updated 2 years ago
- A user-friendly ROS 2 bag filter with a graphical user interface (GUI) ✨☆27May 7, 2025Updated 10 months ago
- your finance bro Agent for trading and investing☆109Nov 8, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆38Feb 4, 2026Updated last month
- ☆18Jul 31, 2025Updated 7 months ago
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆56Feb 4, 2026Updated last month
- [MICCAI 2025] GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images☆15Mar 12, 2026Updated 2 weeks ago
- [CVPR 2025] DV-Matcher: Deformation-based Non-Rigid Point Cloud Matching Guided by Pre-trained Visual Features☆29Sep 5, 2025Updated 6 months ago
- 🧬 Python code that implements the active-finite-Voronoi (AFV) model.☆20Mar 19, 2026Updated last week
- Modality Gap–Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models☆55Feb 23, 2026Updated last month
- 🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆26Feb 9, 2025Updated last year
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆68Jul 22, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- SurgLaVi: Official repository☆29Mar 4, 2026Updated 3 weeks ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 3 months ago
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆25Aug 8, 2025Updated 7 months ago
- ☆10Aug 1, 2021Updated 4 years ago
- PyTorch implementation for our CVPR 2023 paper SE-ORNet: Self-Ensembling Orientation-aware Network for Unsupervised Point Cloud Shape Cor…☆31Sep 6, 2023Updated 2 years ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆32Apr 20, 2025Updated 11 months ago
- Official repo for the paper "Mojito: Motion Trajectory and Intensity Control for Video Generation""☆33Jun 11, 2025Updated 9 months ago
- Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022☆11Aug 20, 2022Updated 3 years ago
- ☆43Feb 25, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 一款开源的知识管理工具-服务端☆110Oct 20, 2025Updated 5 months ago
- [MICCAI 2022] Toward Clinically Assisted Colorectal Polyp Recognition via Structured Cross-modal Representation Consistency☆12Nov 8, 2024Updated last year
- ☆77Feb 5, 2026Updated last month
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- The first to focus on HRNVS of 3DGS☆67Dec 30, 2024Updated last year
- Repository for the paper:☆69Sep 4, 2024Updated last year
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆28Mar 18, 2026Updated last week
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆36Apr 17, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Released Code for ACL 21 paper: DocOIE A Document-level Context-Aware Dataset for OpenIE☆15Nov 25, 2022Updated 3 years ago
- [ICCV 2025] VisRL: Intention-Driven Visual Perception via Reinforced Reasoning☆46Nov 8, 2025Updated 4 months ago
- The Seguro Platform Application.☆28Jun 22, 2024Updated last year
- Official Code for "Large-scale Self-supervised Video Foundation Model for Intelligent Surgery"☆37Jun 4, 2025Updated 9 months ago
- Code and data for Distributional Correlation–Aware Knowledge Distillation for Stock Trading Volume Prediction (ECML-PKDD 22)☆14Sep 6, 2022Updated 3 years ago
- Pytorch Implementation of Our NAACL 2021 Paper "Incorporating Syntax and Semantics in Coreference Resolution with Heterogeneous Graph Att…☆10Apr 28, 2022Updated 3 years ago
- ☆11Jul 3, 2023Updated 2 years ago