OpenGVLab/Siamese-Image-Modeling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenGVLab/Siamese-Image-Modeling)

OpenGVLab / Siamese-Image-Modeling

[CVPR 2023]Implementation of Siamese Image Modeling for Self-Supervised Vision Representation Learning

☆41

Alternatives and similar repositories for Siamese-Image-Modeling

Users that are interested in Siamese-Image-Modeling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChangyaoTian / ADDP
View on GitHub
The official implementation of ADDP (ICLR 2024)
☆12Mar 27, 2024Updated 2 years ago
TencentARC / ConMIM
View on GitHub
Official codes for ConMIM (ICLR 2023)
☆58Feb 8, 2023Updated 3 years ago
ZhanzhouFeng / Evolved-Part-Masking
View on GitHub
The official code for the paper Evolved Part Masking for Self-Supervised Learning.
☆16Jun 14, 2023Updated 3 years ago
ZhichengHuang / CMAE
View on GitHub
The official implementation of CMAE https://arxiv.org/abs/2207.13532 and https://ieeexplore.ieee.org/document/10330745
☆121Jan 27, 2024Updated 2 years ago
bwconrad / can
View on GitHub
PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".
☆39Jan 10, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
shlokk / mae-contrastive
View on GitHub
Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".
☆37Apr 3, 2023Updated 3 years ago
OpenGVLab / De-focus-Attention-Networks
View on GitHub
Learning 1D Causal Visual Representation with De-focus Attention Networks
☆35Jun 7, 2024Updated 2 years ago
facebookresearch / Implicit-HRTF
View on GitHub
This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…
☆11Aug 4, 2023Updated 2 years ago
yliu-cs / PiTe
View on GitHub
[ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model
☆17Feb 13, 2025Updated last year
nttcslab / composing-general-audio-repr
View on GitHub
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model
☆26Apr 26, 2023Updated 3 years ago
Yaxin9Luo / Gamma-MOD
View on GitHub
[ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models
☆45Oct 28, 2025Updated 8 months ago
OpenGVLab / Awesome-DragGAN
View on GitHub
Awesome-DragGAN: A curated list of papers, tutorials, repositories related to DragGAN
☆83Nov 8, 2023Updated 2 years ago
OpenGVLab / STM-Evaluation
View on GitHub
☆70Jun 9, 2026Updated last month
fundamentalvision / Parameterized-AP-Loss
View on GitHub
☆50Nov 10, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
OpenGVLab / InternLMM
View on GitHub
☆16Jul 6, 2023Updated 3 years ago
OpenGVLab / perception_test_iccv2023
View on GitHub
Champion Solutions repository for Perception Test challenges in ICCV2023 workshop.
☆14Oct 18, 2023Updated 2 years ago
sommerda / privacybuckets
View on GitHub
A implementation of Privacy Buckets: A numerical tool to calculate privacy loss
☆11May 19, 2022Updated 4 years ago
OpenGVLab / Official-ConvMAE-Det
View on GitHub
☆18Aug 23, 2022Updated 3 years ago
wangjiangshan0725 / COVE
View on GitHub
[NeurIPS 2024] COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing
☆26Dec 8, 2024Updated last year
OpenGVLab / M3I-Pretraining
View on GitHub
[CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.
☆91Jun 1, 2023Updated 3 years ago
Annbless / RegionCL
View on GitHub
This is the official code repo for "RegionCL: Can Simple Region Swapping Contribute to Contrastive Learning?"
☆38Dec 30, 2021Updated 4 years ago
ilyassmoummad / scl_icbhi2017
View on GitHub
PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)
☆33Feb 4, 2024Updated 2 years ago
MengLcool / SEGIC
View on GitHub
[ECCV-24] This is the official implementation of the paper "SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation".
☆27Oct 13, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
junchen14 / LoMaR
View on GitHub
LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)
☆69Apr 3, 2025Updated last year
lucidrains / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆15May 18, 2021Updated 5 years ago
OpenGVLab / Awesome-LLM4Tool
View on GitHub
A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools
☆68Aug 22, 2023Updated 2 years ago
Maryeon / whiten_mtd
View on GitHub
Official repository of paper "Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval"
☆11Dec 20, 2023Updated 2 years ago
wenhe-jia / TIVE
View on GitHub
☆11Jan 18, 2024Updated 2 years ago
visinf / veto
View on GitHub
Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)
☆22Mar 23, 2026Updated 3 months ago
Echo0125 / MAT-Memory-and-Anticipation-Transformer
View on GitHub
[ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding
☆50Oct 7, 2023Updated 2 years ago
wisdomikezogwo / MMAE_Pathology
View on GitHub
☆12Oct 4, 2023Updated 2 years ago
TACJu / Axial-VS
View on GitHub
This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories
☆27Mar 20, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
haoheliu / ontology-aware-audio-tagging
View on GitHub
☆14Nov 22, 2022Updated 3 years ago
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
ModelTC / OmniBal
View on GitHub
[ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniv…
☆27Jun 16, 2025Updated last year
OpenGVLab / Mono-InternVL
View on GitHub
[CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
☆109Jul 18, 2025Updated last year
CVI-SZU / MG-MotionLLM
View on GitHub
[CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities
☆31Apr 6, 2025Updated last year
yuchenlichuck / CVPR2022
View on GitHub
This is a repo for CVPR 2022 Paper with Code
☆10Apr 13, 2022Updated 4 years ago
ZeyuGaoAi / Instance_based_Vision_Transformer
View on GitHub
Instance-based Vision Transformer for Subtyping of Papillary Renal Cell Carcinoma in Histopathological Image-MICCAI 2021
☆15Dec 26, 2023Updated 2 years ago