yu-rp/VisualPerceptionToken

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yu-rp/VisualPerceptionToken)

yu-rp / VisualPerceptionToken

☆136

Alternatives and similar repositories for VisualPerceptionToken

Users that are interested in VisualPerceptionToken are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yu-rp / NeuralLineage
View on GitHub
Code for CVPR 2024 Oral "Neural Lineage"
☆17Jun 18, 2024Updated 2 years ago
zhishuifeiqian / VCR-Bench
View on GitHub
VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning
☆37May 9, 2026Updated 2 months ago
czg1225 / VeriThinker
View on GitHub
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆67Sep 27, 2025Updated 9 months ago
jiahaolu97 / anything-unsegmentable
View on GitHub
(CVPR 2024) "Unsegment Anything by Simulating Deformation"
☆29May 27, 2024Updated 2 years ago
Carol-lyh / GateControl
View on GitHub
☆22Apr 3, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
florinshen / PlaneDreamer
View on GitHub
DreamGaussian with 2D-GS
☆12Oct 10, 2024Updated last year
Adamdad / vico
View on GitHub
Vico: Compositional Video Generation as Flow Equalization
☆59Nov 15, 2024Updated last year
yu-rp / apiprompting
View on GitHub
[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models
☆112Oct 10, 2024Updated last year
Yuanshi9815 / LiteFocus
View on GitHub
[Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.
☆34Mar 11, 2025Updated last year
florinshen / Vista3D
View on GitHub
[ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image
☆57Sep 19, 2024Updated last year
MaybeLizzy / PERMU
View on GitHub
☆34Oct 4, 2025Updated 9 months ago
yu-rp / Dimple
View on GitHub
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆117Jul 9, 2025Updated last year
Haochen-Wang409 / TreeVGR
View on GitHub
[ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
☆92Jan 26, 2026Updated 5 months ago
Lexie-YU / ViFeEdit
View on GitHub
[Preprint] ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer
☆67Mar 31, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
VainF / Reasoning-SFT
View on GitHub
SFT of Reasoning LLMs with Megatron-LM
☆23Jun 19, 2025Updated last year
haiquanlu / Mix-Quant
View on GitHub
☆36May 21, 2026Updated 2 months ago
saccharomycetes / mllms_know
View on GitHub
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
☆380Apr 20, 2025Updated last year
YujiaHu1109 / IEAP
View on GitHub
[NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models
☆118Sep 27, 2025Updated 9 months ago
fscdc / Awesome-Efficient-Reasoning-Models
View on GitHub
[TMLR 2025] Efficient Reasoning Models: A Survey
☆314Jun 26, 2026Updated 3 weeks ago
Huage001 / URAE
View on GitHub
[ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".
☆118May 3, 2025Updated last year
Huage001 / StyDeSty
View on GitHub
PyTorch implementation of paper "StyDeSty: Min-Max Stylization and Destylization for Single Domain Generalization" in ICML 2024.
☆16Jun 4, 2024Updated 2 years ago
jiahaolu97 / poison-splat
View on GitHub
(ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"
☆78Feb 13, 2025Updated last year
VainF / Isomorphic-Pruning
View on GitHub
[ECCV 2024] Isomorphic Pruning for Vision Models
☆89Jul 23, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
VainF / Thinkless
View on GitHub
[NeurIPS 2025] Thinkless: LLM Learns When to Think
☆261Sep 26, 2025Updated 9 months ago
jungao1106 / ICoT
View on GitHub
[CVPR' 25] Interleaved-Modal Chain-of-Thought
☆112Dec 30, 2025Updated 6 months ago
SalesforceAIResearch / LATTE
View on GitHub
☆70Jun 2, 2026Updated last month
SuhZhang / GeoSR
View on GitHub
The code for paper 'Make Geometry Matter for Spatial Reasoning'
☆53Jul 1, 2026Updated 2 weeks ago
LiQiiiii / Neural-Ligand
View on GitHub
[ICCV‘25] Official implementation of paper "Towards Performance Consistency in Multi-Level Model Collaboration"
☆45Oct 23, 2025Updated 8 months ago
shiqichen17 / AdaptVis
View on GitHub
Github repository for "Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas" (ICML 2025)
☆76May 2, 2025Updated last year
fscdc / ReasonMap
View on GitHub
[CVPR 2026] ReasonMap: Towards Fine-Grained Visual Reasoning from Transit Maps
☆86Feb 22, 2026Updated 4 months ago
czg1225 / CoDe
View on GitHub
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆108Sep 27, 2025Updated 9 months ago
TIGER-AI-Lab / Pixel-Reasoner
View on GitHub
Pixel-Level Reasoning Model trained with RL [NeuIPS25]
☆300Jun 4, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tsa18 / ConciseHint
View on GitHub
[Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation
☆26Oct 1, 2025Updated 9 months ago
AFeng-x / Draw-and-Understand
View on GitHub
[ICLR2025] Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
☆94Dec 1, 2025Updated 7 months ago
VainF / TinyFusion
View on GitHub
[CVPR 2025 Highlight] TinyFusion: Diffusion Transformers Learned Shallow
☆170Dec 1, 2025Updated 7 months ago
nopnor / SCOPE
View on GitHub
☆32May 11, 2026Updated 2 months ago
YinBo0927 / RePro
View on GitHub
The official code of Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior
☆22Jan 6, 2026Updated 6 months ago
JngwenYe / LIRF
View on GitHub
Code for ECCV 2022 paper “Learning with Recoverable Forgetting”
☆21Jul 27, 2022Updated 3 years ago
yu-rp / Distribution-Shift-Iverson
View on GitHub
☆42Sep 5, 2023Updated 2 years ago