pritamqu/CrissCross

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pritamqu/CrissCross)

pritamqu / CrissCross

[AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity

☆26

Alternatives and similar repositories for CrissCross

Users that are interested in CrissCross are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
valterlej / zsarcap
View on GitHub
Official code for Tell Me What You See: A Zero-Shot Action Recognition Method Based on Natural Language Descriptions (Multimedia Tools an…
☆13Mar 8, 2024Updated 2 years ago
xiaobai1217 / RepetitionCounting
View on GitHub
Code for "Repetitive Activity Counting by Sight and Sound"
☆24Oct 29, 2021Updated 4 years ago
alibaba-mmai-research / HiCo
View on GitHub
CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
☆18Aug 10, 2022Updated 3 years ago
swagshaw / ASC-CL
View on GitHub
Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification
☆14Jul 19, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
SMILE-data / SMILE
View on GitHub
SMILE: A Multimodal Dataset for Understanding Laughter
☆13Jun 15, 2023Updated 3 years ago
lambert-x / video-semisup
View on GitHub
Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)
☆30Dec 1, 2022Updated 3 years ago
shuheikurita / RefEgo
View on GitHub
☆13Jul 20, 2024Updated 2 years ago
florianHofherr / PhysParamInference
View on GitHub
☆19Jan 30, 2023Updated 3 years ago
yiskw713 / ActionRecognition
View on GitHub
This repo is for action recognition using Kinetics dataset with pytorch
☆11Aug 5, 2019Updated 6 years ago
AndongDeng / BEAR
View on GitHub
BEAR: a new BEnchmark on video Action Recognition
☆46Apr 21, 2024Updated 2 years ago
IFICL / SLfM
View on GitHub
Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
☆43Jul 16, 2026Updated last week
SitongGong / Veason-R1
View on GitHub
Official code of Veason-R1
☆15Jul 14, 2026Updated 2 weeks ago
junhocho / HGCAE
View on GitHub
HGCAE Pytorch implementation. CVPR2021 accepted.
☆45Jun 29, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lscpku / VITATECS
View on GitHub
☆18Jul 10, 2024Updated 2 years ago
yzfly / TCM
View on GitHub
TCM: Temporal Correlation Module
☆17Apr 24, 2021Updated 5 years ago
danielchyeh / this-is-my
View on GitHub
Official This-Is-My Dataset published in CVPR 2023
☆16Jul 18, 2024Updated 2 years ago
WangHelin1997 / MaskSpec
View on GitHub
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
☆51Dec 17, 2024Updated last year
yzyouzhang / Empirical-Channel-CM
View on GitHub
Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure …
☆19Feb 15, 2022Updated 4 years ago
cf020031308 / LinkDist
View on GitHub
Distillation Self-Knowledge From Contrastive Links to Classify Graph Nodes Without Passing Messages.
☆15Jun 17, 2021Updated 5 years ago
justfortherec / uva-beamer-template
View on GitHub
LaTeX beamer template in corporate design of University of Amsterdam
☆13Dec 7, 2015Updated 10 years ago
martinetoering / ViCC
View on GitHub
[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…
☆36Aug 16, 2022Updated 3 years ago
sirilerklab / pytwitterscraper
View on GitHub
Twitter Scraper With Python
☆12Apr 12, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ethanlshen / HierNet
View on GitHub
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…
☆23Nov 8, 2023Updated 2 years ago
pritamqu / AVCAffe
View on GitHub
[AAAI 2023] AVCAffe: A Large Scale Audio-Visual Dataset of Cognitive Load and Affect for Remote Work
☆22Dec 7, 2025Updated 7 months ago
haoheliu / diffres-python
View on GitHub
Learning differentiable temporal resolution on time-series data.
☆36Nov 12, 2022Updated 3 years ago
tandav / pitch-detectors
View on GitHub
collection of pitch (f0, fundamental frequency) detection algorithms with unified interface
☆25Nov 25, 2024Updated last year
WikiChao / Ego-AV-Loc
View on GitHub
[CVPR 2023] Egocentric Audio-Visual Object Localization
☆27Jan 6, 2024Updated 2 years ago
jmiemirza / ActMAD
View on GitHub
ActMAD: Activation Matching to Align Distributions for Test-Time-Training (CVPR 2023)
☆21Jun 27, 2023Updated 3 years ago
KHU-VLL / DEVIAS
View on GitHub
[ECCV 2024 Oral] Official implementation of the paper "DEVIAS: Learning Disentangled Video Representations of Action and Scene"
☆29Nov 15, 2025Updated 8 months ago
AndreyGuzhov / ESResNeXt-fbsp
View on GitHub
Source code for models described in the paper "ESResNe(X)t-fbsp: Learning Robust Time-Frequency Transformation of Audio" (https://arxiv.o…
☆47Jun 29, 2021Updated 5 years ago
Nanne / ProtoSim
View on GitHub
Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison
☆18Dec 15, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
i6092467 / vadesc
View on GitHub
A probabilistic model to cluster survival data in a variational deep clustering setting
☆32Aug 3, 2022Updated 3 years ago
Cassie07 / Using-neural-network-for-HAR
View on GitHub
Human activity recognition(LSTM, BidLSTM, BidLSTM+CNN, LSTM+CNN)
☆16Mar 6, 2018Updated 8 years ago
arijitray1993 / COLA
View on GitHub
COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!
☆25May 14, 2026Updated 2 months ago
Tenglon / hyperbolic_action
View on GitHub
Code of CVPR2020 Paper "Searching for actions on the hyperbole"
☆12Apr 20, 2021Updated 5 years ago
zhaoyanpeng / vipant
View on GitHub
VIsually-Pivoted Audio and(N) Text
☆22May 16, 2022Updated 4 years ago
sayakpaul / MLPMixer-jax2tf
View on GitHub
This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.
☆15Sep 29, 2021Updated 4 years ago
ahmedgamaleldin14 / online-action-recognition
View on GitHub
Implementation of CNN-Based Model for Online Action Recognition
☆13Aug 12, 2019Updated 6 years ago