bytedance/ParGo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bytedance/ParGo)

bytedance / ParGo

Official PyTorch Implementation of ParGo: Bridging Vision-Language with Partial and Global Views. (AAAI 2025)

☆16

Alternatives and similar repositories for ParGo

Users that are interested in ParGo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YYJMJC / LOUPE
View on GitHub
☆45Aug 14, 2023Updated 2 years ago
lerogo / aaai24_itr_cusa
View on GitHub
Source code of our AAAI 2024 paper "Cross-Modal and Uni-Modal Soft-Label Alignment for Image-Text Retrieval"
☆55Mar 28, 2024Updated 2 years ago
Sutadasuto / syncrack_generator
View on GitHub
Code for generating synthetic pavement images with cracks (as well as the ground truth annotation of such cracks).
☆13Apr 20, 2022Updated 4 years ago
snskysk / CAM-Back-Again
View on GitHub
This is the project page for paper `CAM Back Again: Large Kernel CNNs from a Weakly Supervised Object Localization Perspective`, in CVPR2…
☆13Mar 19, 2024Updated 2 years ago
FireRedTeam / IVC-Prune
View on GitHub
IVC-Prune: Revealing the Implicit Visual Coordinates in LVLMs for Vision Token Pruning
☆16Feb 27, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
alipay / PC2-NoiseofWeb
View on GitHub
Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …
☆16Nov 20, 2025Updated 8 months ago
96-Zachary / vse_2ad
View on GitHub
☆15Apr 30, 2022Updated 4 years ago
CrossmodalGroup / ESL
View on GitHub
☆12May 3, 2024Updated 2 years ago
mayug / 0-shot-llm-vision
View on GitHub
This repository contains the code for our CVPR 2024 paper,
☆16Aug 27, 2024Updated last year
cvsp-lab / AgilePruner
View on GitHub
[ICLR 2026] AgilePruner: An Empirical Study of Attention and Diversity for Adaptive Visual Token Pruning in Large Vision-Language Models
☆28Mar 3, 2026Updated 4 months ago
sooonwoo / CL-Baselines
View on GitHub
This is a Pytorch implementation of contrastive Learning(CL) baselines.
☆14Aug 29, 2022Updated 3 years ago
AlexZaikin94 / MoCo-v2
View on GitHub
an implementation of MoCo and MoCo-v2 improvements pre-trained on Imagenette
☆24Jun 15, 2021Updated 5 years ago
nadsoft-opensource / RAG-with-open-source-multi-modal
View on GitHub
☆20Jan 7, 2024Updated 2 years ago
Yaziwel / Awesome-Medical-Image-Restoration
View on GitHub
This is a summary of research on general-purpose Medical Image Restoration. Please raise an issue if you suggest new qualified project.
☆15Dec 16, 2025Updated 7 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
imedslab / AdaTriplet
View on GitHub
AdaTriplet loss & automargin method
☆20Mar 8, 2022Updated 4 years ago
Xiaomeng-Yang / STR_benchmark_cleansed
View on GitHub
☆14May 26, 2023Updated 3 years ago
youngtboy / Awesome-Self-Supervised-Vision-Pretrain
View on GitHub
A paper list of self-supervised pretrain method
☆24Jun 16, 2026Updated last month
letitiabanana / PnP-OVSS
View on GitHub
[CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models
☆18Jul 22, 2024Updated last year
KevinLight831 / ESA
View on GitHub
[TCSVT2023] - ESA: External Space Attention Aggregation for Image-Text Retrieval
☆23Aug 30, 2024Updated last year
astrobdr / MoCoV2_CIFAR10
View on GitHub
Training MoCoV2 on the CIFAR10 Dataset
☆16Sep 14, 2021Updated 4 years ago
zaleni / TBot-SA1
View on GitHub
2D-3D Latent World Action Modeling for Generalizable Robot Control
☆17Jul 4, 2026Updated 2 weeks ago
AntonVanke / Kaptcha
View on GitHub
Python验证码生成工具
☆11Mar 5, 2022Updated 4 years ago
adlnlp / pdfvqa
View on GitHub
☆18Jun 12, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bytedance / AncientDoc
View on GitHub
☆15Oct 10, 2025Updated 9 months ago
THU-BPM / ICT
View on GitHub
Official repo for ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models
☆28Mar 24, 2025Updated last year
CrossmodalGroup / LAPS
View on GitHub
Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment, CVPR, 2024
☆110Jun 26, 2025Updated last year
TangXu-Group / Cross-modal-remote-sensing-image-and-text-retrieval-models
View on GitHub
☆22Sep 19, 2024Updated last year
ShaShiDiZhuanLan / Demo_Matplotlib_Python
View on GitHub
python3绘制折线图、柱形图、饼图、三维散点图、散点图
☆18Dec 12, 2019Updated 6 years ago
VL-Group / 2022-NeurIPS-DAA
View on GitHub
The code of the paper of "A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval" accepted b…
☆19Jan 16, 2024Updated 2 years ago
hzlbbfrog / CrackMamba
View on GitHub
Mamba meets crack segmentation
☆22Apr 13, 2025Updated last year
laoyangui / HSPAN
View on GitHub
☆24Dec 22, 2023Updated 2 years ago
kinredon / SCOPE
View on GitHub
An implementation of SCOPE: Saliency-Coverage Oriented Token Pruning for Efficient Multimodel LLMs (NeurIPS 2025)
☆31Oct 31, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
UCSC-VLAA / CLIPS
View on GitHub
An Enhanced CLIP Framework for Learning with Synthetic Captions
☆40Apr 18, 2025Updated last year
zsenliao / initServer
View on GitHub
一个服务器初始化及MySQL/PHP/Python3/Redis/Nodejs/Nginx/ikev2自动安装脚本，包含一个站点管理工具
☆11Feb 27, 2023Updated 3 years ago
satim-co / PolSARpro
View on GitHub
Re-implementation of selected PolSARpro functions in Python, following the scientific recommendations of PolInSAR 2021 (Work In Progress)…
☆22Jun 18, 2026Updated last month
geonhobang / RadarDistill
View on GitHub
[CVPR2024] Official Implementation of "RadarDistill: Boosting Radar-based Object Detection Performance via Knowledge Distillation from L…
☆29Jan 7, 2025Updated last year
cloudwebrtc / pion-webrtc
View on GitHub
A pure Golang implementation of the WebRTC Native API
☆12Dec 19, 2021Updated 4 years ago
prs-eth / DGInStyle-SegModel
View on GitHub
Downstream semantic segmentation evaluation of DGInStyle.
☆25Apr 1, 2024Updated 2 years ago
Yanqing0327 / MLLMs-Augmented
View on GitHub
The official implementation of 《MLLMs-Augmented Visual-Language Representation Learning》
☆31Mar 12, 2024Updated 2 years ago