OpenGVLab/perception_test_iccv2023

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenGVLab/perception_test_iccv2023)

OpenGVLab / perception_test_iccv2023

Champion Solutions repository for Perception Test challenges in ICCV2023 workshop.

☆14

Alternatives and similar repositories for perception_test_iccv2023

Users that are interested in perception_test_iccv2023 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenGVLab / opengvlab.github.io
View on GitHub
☆15Feb 28, 2023Updated 3 years ago
OpenGVLab / LORIS
View on GitHub
[ICML2023] Long-Term Rhythmic Video Soundtracker
☆63Jul 28, 2025Updated 11 months ago
OpenGVLab / EgoVideo
View on GitHub
[CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024
☆136May 11, 2025Updated last year
OpenGVLab / Awesome-LLM4Tool
View on GitHub
A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools
☆68Aug 22, 2023Updated 2 years ago
Richard-61 / FineAction
View on GitHub
The official codebase of FineAction dataset. We will update the data and code of our FineAction.
☆24Apr 10, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
OpenGVLab / VRBench
View on GitHub
[ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos
☆28Jun 4, 2026Updated last month
cg1177 / DCAN
View on GitHub
[AAAI 2022] DCAN: Improving Temporal Action Detection via Dual Context Aggregation
☆17Nov 13, 2022Updated 3 years ago
TencentARC / common_trainer
View on GitHub
Common template for pytorch project. Easy to extent and modify for new project.
☆13Dec 13, 2022Updated 3 years ago
xiaoboxia / HLC
View on GitHub
ICCV'2023: Holistic Label Correction for Noisy Multi-Label Classification
☆13Oct 29, 2023Updated 2 years ago
q935970314 / AIM22-ReverseISP-MiAlgo
View on GitHub
PyTorch implementation of AIM 2022 reverse ISP challenge by MiAlgo team.
☆12Oct 20, 2022Updated 3 years ago
VickiCui / MORE
View on GitHub
Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"
☆11Oct 11, 2024Updated last year
OpenGVLab / InternLMM
View on GitHub
☆16Jul 6, 2023Updated 3 years ago
diaoyn / 3DGSQA
View on GitHub
Perceptual Quality Assessment of 3D Gaussian Splatting: A Subjective Dataset and Prediction Metric
☆16Nov 12, 2025Updated 8 months ago
OpenGVLab / unmasked_teacher
View on GitHub
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
☆348May 27, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MCG-NJU / BasicTAD
View on GitHub
BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection
☆52Jun 10, 2023Updated 3 years ago
OpenGVLab / DDPS
View on GitHub
Official Implementation of "Denoising Diffusion Semantic Segmentation with Mask Prior Modeling"
☆76Jul 27, 2023Updated 2 years ago
LuoXiaoHeics / COBE
View on GitHub
☆11Aug 23, 2022Updated 3 years ago
epic-kitchens / epic-sounds-annotations
View on GitHub
Splits for epic-sounds dataset
☆85Aug 2, 2025Updated 11 months ago
OpenGVLab / Official-ConvMAE-Det
View on GitHub
☆18Aug 23, 2022Updated 3 years ago
ruoyiran / DeepID
View on GitHub
The implementation of the first generation DeepID
☆11Jul 6, 2021Updated 5 years ago
OpenGVLab / UniFormerV2
View on GitHub
[ICCV2023] UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
☆351Apr 2, 2024Updated 2 years ago
fpv-iplab / stillfast
View on GitHub
Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…
☆13Apr 11, 2023Updated 3 years ago
AmirAbaskohi / LM-CPPF
View on GitHub
The official implementation of the ACL 2023 paper, "Paraphrasing-Guided Data Augmentation for Contrastive Prompt-based Few-shot Fine-tuni…
☆11Nov 28, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jakhac / CSMAE
View on GitHub
Cross-Sensor Masked Autoencoder for Content Based Image Retrieval in Remote Sensing
☆28Dec 18, 2024Updated last year
zslzx / CrossModalFlow
View on GitHub
Pytorch implementation of Promoting Single-Modal Optical Flow Network for Diverse Cross-modal Flow Estimation (AAAI 2022). The model can…
☆25Apr 11, 2023Updated 3 years ago
yujiangpu20 / cma_xdVioDet
View on GitHub
Official code for "Audio-Guided Attention Network for Weakly Supervised Violence Detection" (ICCECE2022).
☆13Mar 25, 2022Updated 4 years ago
OpenGVLab / VKnowU
View on GitHub
[ECCV 2026] VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs
☆15Feb 3, 2026Updated 5 months ago
JustinYuu / MM_Pyramid
View on GitHub
[ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
☆15Aug 26, 2022Updated 3 years ago
CrawlScript / Awesome-LLM-Fake-News-Detection
View on GitHub
☆16Jun 3, 2025Updated last year
Franklin905 / VALOR
View on GitHub
Research code for NeurIPS 2023 paper "Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser"
☆17Jul 13, 2025Updated last year
haesleinhuepf / translate-pptx
View on GitHub
translate-pptx is a command line tool that translates PowerPoint PPTX files from one language to another.
☆20Mar 9, 2025Updated last year
sisuolv / CVPR--Sorghum--100-Cultivar-Identification--FGVC-9--3rd
View on GitHub
https://www.kaggle.com/competitions/sorghum-id-fgvc-9
☆19Mar 1, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pq-chen / CasP
View on GitHub
[ICCV 2025] CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance
☆23Jan 4, 2026Updated 6 months ago
CPJKU / cpjku_dcase24
View on GitHub
☆29Oct 17, 2024Updated last year
huangjk97 / T2T
View on GitHub
Code for the paper: "Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning"
☆25Oct 25, 2021Updated 4 years ago
techycs18 / human-activity-recognition
View on GitHub
A Human Activity Recognition project which is capable of detecting 400 different activities
☆14Jul 22, 2020Updated 5 years ago
OpenGVLab / EmbodiedGPT
View on GitHub
☆34May 27, 2023Updated 3 years ago
WangWenhao0716 / AnyPattern
View on GitHub
[IJCV 2025] The official implementation of "AnyPattern: Towards In-context Image Copy Detection"
☆11Oct 24, 2025Updated 8 months ago
Echo0125 / MAT-Memory-and-Anticipation-Transformer
View on GitHub
[ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding
☆50Oct 7, 2023Updated 2 years ago