yu-rp/apiprompting

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yu-rp/apiprompting)

yu-rp / apiprompting

[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models

☆112

Alternatives and similar repositories for apiprompting

Users that are interested in apiprompting are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yu-rp / NeuralLineage
View on GitHub
Code for CVPR 2024 Oral "Neural Lineage"
☆17Jun 18, 2024Updated 2 years ago
Yuanshi9815 / LiteFocus
View on GitHub
[Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.
☆34Mar 11, 2025Updated last year
yu-rp / Dimple
View on GitHub
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆117Jul 9, 2025Updated last year
czg1225 / VeriThinker
View on GitHub
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆67Sep 27, 2025Updated 9 months ago
Adamdad / vico
View on GitHub
Vico: Compositional Video Generation as Flow Equalization
☆59Nov 15, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yu-rp / VisualPerceptionToken
View on GitHub
☆136Mar 22, 2025Updated last year
haiquanlu / Mix-Quant
View on GitHub
☆37May 21, 2026Updated 2 months ago
florinshen / Vista3D
View on GitHub
[ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image
☆57Sep 19, 2024Updated last year
Carol-lyh / GateControl
View on GitHub
☆22Apr 3, 2026Updated 3 months ago
VainF / In-Video-Instructions
View on GitHub
[Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control
☆45Nov 25, 2025Updated 8 months ago
xing0047 / cca-llava
View on GitHub
[NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention
☆67Aug 30, 2025Updated 10 months ago
Lexie-YU / ViFeEdit
View on GitHub
[Preprint] ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer
☆67Mar 31, 2026Updated 3 months ago
mrwu-mac / ControlMLLM
View on GitHub
[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'
☆211Jul 17, 2025Updated last year
YiyangZhou / CSR
View on GitHub
[NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models
☆87Oct 26, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
GasolSun36 / MVP
View on GitHub
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
☆24Sep 9, 2024Updated last year
hefeimei06 / IDC
View on GitHub
[AAAI 2025] Efficient Image-to-Image Diffusion Classifier for Adversarial Robustness
☆20Aug 21, 2024Updated last year
YinBo0927 / RePro
View on GitHub
The official code of Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior
☆22Jan 6, 2026Updated 6 months ago
LALBJ / PAI
View on GitHub
[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
☆171Nov 6, 2024Updated last year
hefeimei06 / VEAttack-LVLM
View on GitHub
[ICLR 2026] VEAttack: Downstream-agnostic Vision Encoder Attack against Large Vision Language Models
☆21Feb 22, 2026Updated 5 months ago
SuhZhang / GeoSR
View on GitHub
The code for paper 'Make Geometry Matter for Spatial Reasoning'
☆53Updated this week
tsa18 / ConciseHint
View on GitHub
[Preprint arXiv: 2506.18810 ] ConciseHint: Boosting Efficient Reasoning via Continuous Concise Hints during Generation
☆26Oct 1, 2025Updated 9 months ago
Huage001 / Paint-Anything
View on GitHub
An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.
☆35Apr 16, 2023Updated 3 years ago
ShawnHuang497 / BiRD
View on GitHub
The official repository of paper named 'A Refer-and-Ground Multimodal Large Language Model for Biomedicine'
☆34Nov 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jiahaolu97 / anything-unsegmentable
View on GitHub
(CVPR 2024) "Unsegment Anything by Simulating Deformation"
☆29May 27, 2024Updated 2 years ago
Huage001 / URAE
View on GitHub
[ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".
☆118May 3, 2025Updated last year
florinshen / PlaneDreamer
View on GitHub
DreamGaussian with 2D-GS
☆12Oct 10, 2024Updated last year
BillChan226 / HALC
View on GitHub
[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"
☆115Dec 4, 2024Updated last year
LiQiiiii / Neural-Ligand
View on GitHub
[ICCV‘25] Official implementation of paper "Towards Performance Consistency in Multi-Level Model Collaboration"
☆45Oct 23, 2025Updated 9 months ago
jiahaolu97 / poison-splat
View on GitHub
(ICLR 2025 spotlight) "Poison-splat: Computation Cost Attack on 3D Gaussian Splatting"
☆78Feb 13, 2025Updated last year
HKUSTGZ-ML4Health-Lab / Med-Scout
View on GitHub
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training
☆16Feb 8, 2026Updated 5 months ago
horseee / learning-to-cache
View on GitHub
[NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
☆122Jul 15, 2024Updated 2 years ago
czg1225 / CoDe
View on GitHub
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆108Sep 27, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Adamdad / Samesame
View on GitHub
An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…
☆10Dec 18, 2019Updated 6 years ago
VainF / Reasoning-SFT
View on GitHub
SFT of Reasoning LLMs with Megatron-LM
☆23Jun 19, 2025Updated last year
Adamdad / neumeta
View on GitHub
NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…
☆45Nov 8, 2024Updated last year
s-vco / s-vco
View on GitHub
Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images
☆19Jun 4, 2025Updated last year
1zhou-Wang / MemVR
View on GitHub
[ICML 2025] Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in…
☆171Sep 25, 2025Updated 10 months ago
ycjing / AmalgamateGNN.PyTorch
View on GitHub
PyTorch implementation of AmalgamateGNN (CVPR'21)
☆21Jul 29, 2022Updated 3 years ago
Adamdad / Filter-Gradient-Decent
View on GitHub
In this paper, we propose Filter Gradient Decent (FGD), an efficient stochastic optimization algorithm that makes a consistent estimation…
☆12May 18, 2021Updated 5 years ago