zhangquanchen/SIFThinker

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhangquanchen/SIFThinker)

zhangquanchen / SIFThinker

[AAAI 2026] SIFThinker: Spatially-Aware Image Focus for Visual Reasoning

☆22

Alternatives and similar repositories for SIFThinker

Users that are interested in SIFThinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhangquanchen / 3DThinker
View on GitHub
[CVPR 2026] Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views
☆244May 7, 2026Updated 2 months ago
zhangquanchen / 4DThinker
View on GitHub
4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding
☆78May 26, 2026Updated 2 months ago
HiBugEnterprise / HiBug-6B
View on GitHub
HiBug-6B: A Powerful Assisting Coding LLM ｜专注于辅助编程的6B模型
☆14Aug 10, 2023Updated 2 years ago
sinberCS / switch2ai
View on GitHub
switch2ai - A JetBrains IDE plugin enabling seamless collaboration between JetBrains IDEs and various AI agents (Cursor, Qoder, Claude co…
☆173Nov 11, 2025Updated 8 months ago
MarkLee131 / Hypervisor-Testing-Survey
View on GitHub
A collection of research papers on hypervisor testing.
☆65May 21, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
starshine-f / Agent-Debate
View on GitHub
A multi-agent debate framework supporting AI-vs-AI and Human-vs-AI modes with customizable models, personas, and role-specific prompts.
☆67Dec 4, 2025Updated 7 months ago
fscdc / Oracle-Pruning-Sanity-Check
View on GitHub
[TMLR 2026] Is Oracle Pruning the True Oracle?
☆35Jul 1, 2026Updated 3 weeks ago
JingyuanXu / ucfaceconbainall
View on GitHub
Unified Semantic Curation Face (USCFace): An RDF Curation & Visualization System
☆38Jul 18, 2025Updated last year
ECNU-SII / Continual-NExT
View on GitHub
☆235Jun 27, 2026Updated last month
DEFENSE-SEU / RobustFlow
View on GitHub
Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"
☆238Oct 19, 2025Updated 9 months ago
ant-research / AvatarArtist
View on GitHub
[CVPR'25] Official PyTorch implementation of AvatarArtist: Open-Domain 4D Avatarization.
☆280Jun 14, 2025Updated last year
zhangyulin-space / ChatFerry
View on GitHub
☆104Oct 8, 2025Updated 9 months ago
Victor20082018 / -Optimized-Aquatic-Target-Recognition-Model
View on GitHub
The enhanced model is specially trained for aquatic targets, achieving higher accuracy. It can detect sailboats, humans, other vessels, b…
☆48May 15, 2025Updated last year
FrankSuperG / CPG-SPMT
View on GitHub
CPG-SPMT: Control-oriented Parameter-Grouped Single Particle Model with Thermal effects
☆80Apr 22, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
liufanfanlff / C3-Context-Cascade-Compression
View on GitHub
Official code implementation of Context Cascade Compression: Exploring the Upper Limits of Text Compression
☆313Jan 27, 2026Updated 6 months ago
yunbeizhang / Awesome-Visual-Prompt-Tuning
View on GitHub
[TMLR] A curated list of awesome papers, resources, and tools for Visual Prompt Tuning (VPT).
☆115Feb 22, 2026Updated 5 months ago
AIR-DISCOVER / FreeAskWorld
View on GitHub
[AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level plannin…
☆229Jul 3, 2026Updated 3 weeks ago
rqhuang88 / DV-Matcher
View on GitHub
[CVPR 2025] DV-Matcher: Deformation-based Non-Rigid Point Cloud Matching Guided by Pre-trained Visual Features
☆29Sep 5, 2025Updated 10 months ago
RLHFlow / Reinforce-Ada
View on GitHub
[COLM 2026] An adaptive sampling framework for Reinforce-style LLM post training.
☆97Nov 29, 2025Updated 8 months ago
xlyu0106 / MACT
View on GitHub
☆19Jul 31, 2025Updated 11 months ago
gulucaptain / DynamiCtrl
View on GitHub
[TMM'26] Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.
☆142May 23, 2025Updated last year
GaotangLi / Beyond-Log-Likelihood
View on GitHub
[ICML'26 Spotlight] What is the right loss function for LLM supervised finetuning?
☆66May 28, 2026Updated 2 months ago
EDAPINENUT / ExplicitShortCut
View on GitHub
Official implementation of the paper <On the Design of One-Step Diffusion via Shortcutting Flow Paths>
☆287Apr 1, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tangpan360 / MicroRCA-Agent
View on GitHub
2025 CCF International AIOps Challenge | Track 1: Microservice Root Cause Localization Based on Large Model Agents | "男团910" Solution · T…
☆259Jan 14, 2026Updated 6 months ago
zhangquanchen / VisRL
View on GitHub
[ICCV 2025] VisRL: Intention-Driven Visual Perception via Reinforced Reasoning
☆47Nov 8, 2025Updated 8 months ago
gwh22 / UniVoice
View on GitHub
UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models
☆115Oct 30, 2025Updated 8 months ago
LijunRio / Awesome-Nano-Banana-for-Medical-Imaging
View on GitHub
Exploring Gemini-2.5-Flash-Image in medical imaging—segmentation, simulation, and cross-modal understanding with synthetic examples.
☆37Dec 14, 2025Updated 7 months ago
jiaweizzhao / InRank
View on GitHub
☆153Jan 2, 2024Updated 2 years ago
Ga-Lee / Frequency-aware-Length-EXtension
View on GitHub
official implementation for paper titled "Training-free Horizon Extension for Autoregressive Video Generation"
☆117Feb 17, 2026Updated 5 months ago
serendipity800 / open-motion-apis
View on GitHub
☆80Mar 5, 2026Updated 4 months ago
zfr00 / ESCNet
View on GitHub
the official code for ESCNet: Entity-enhanced and Stance Checking Network for Multi-modal Fact-Checking
☆42Jan 14, 2025Updated last year
FastMAS / KVCOMM
View on GitHub
[NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems
☆182Nov 3, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
guanhaisu / OBSD
View on GitHub
Deciphering Oracle Bone Language with Diffusion Models (ACL 2024 Best Paper)
☆232Sep 17, 2025Updated 10 months ago
Jinghaoleven / RLFR
View on GitHub
Official implementation of RLFR: Extending Reinforcement Learning for LLMs with Flow Environment
☆48Nov 15, 2025Updated 8 months ago
kand-ta / kand
View on GitHub
Kand: Blazing-Fast, Modern Technical Analysis in Rust, Python, and WASM.
☆564Jan 22, 2026Updated 6 months ago
HKUDS / LightReasoner
View on GitHub
[ACL 2026 Oral] "LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"
☆604May 22, 2026Updated 2 months ago
THUDM / INFTY
View on GitHub
INFTY Engine: An Optimization Toolkit to Support Continual AI
☆573Jun 8, 2026Updated last month
Haochen-Wang409 / ross3d
View on GitHub
[ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
☆70Jul 22, 2025Updated last year
fengzeAltos / ROS2-Bag-Filter
View on GitHub
A user-friendly ROS 2 bag filter with a graphical user interface (GUI) ✨
☆27May 7, 2025Updated last year