JacobChalk/TIM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JacobChalk/TIM)

JacobChalk / TIM

Codebase for the paper: "TIM: A Time Interval Machine for Audio-Visual Action Recognition"

☆54

Alternatives and similar repositories for TIM

Users that are interested in TIM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

epic-kitchens / epic-kitchens-100-object-masks
View on GitHub
Support library for the MaskRCNN masks extracted on EPIC-KITCHENS-100
☆14Dec 1, 2020Updated 5 years ago
epic-kitchens / epic-sounds-annotations
View on GitHub
Splits for epic-sounds dataset
☆85Aug 2, 2025Updated 11 months ago
ekazakos / MTCN
View on GitHub
Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch
☆20Dec 16, 2021Updated 4 years ago
EGO4D / forecasting
View on GitHub
☆82Jan 5, 2024Updated 2 years ago
GeWu-Lab / LFAV
View on GitHub
Towards Long Form Audio-visual Video Understanding
☆15Jan 16, 2026Updated 6 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
epic-kitchens / epic-kitchens-100-annotations
View on GitHub
Annotations for the public release of the EPIC-KITCHENS-100 dataset
☆173Aug 1, 2022Updated 3 years ago
OpenNLPLab / FNAC_AVL
View on GitHub
[CVPR 2023] Official implementation of our paper - Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learnin…
☆29Apr 10, 2023Updated 3 years ago
ttgeng233 / UnAV
View on GitHub
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)
☆73Jan 4, 2026Updated 6 months ago
yannqi / COMBO-AVS
View on GitHub
[CVPR 2024 Highlight] Official implementation of the paper: Cooperation Does Matter: Exploring Multi-Order Bilateral Relations for Audio-…
☆40Apr 20, 2025Updated last year
Nmegha2601 / anticipatr
View on GitHub
☆12Apr 6, 2023Updated 3 years ago
yzxing87 / Seeing-and-Hearing
View on GitHub
[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
☆155Jul 6, 2024Updated 2 years ago
vvvb-github / AVSegFormer
View on GitHub
[AAAI 2024] AVSegFormer: Audio-Visual Segmentation with Transformer
☆74Mar 6, 2025Updated last year
sming256 / ETAD
View on GitHub
[CVPRW2023] The official implementation of ETAD: A Unified Framework for Efficient Temporal Action Detection
☆19Oct 3, 2024Updated last year
Franklin905 / VALOR
View on GitHub
Research code for NeurIPS 2023 paper "Modality-Independent Teachers Meet Weakly-Supervised Audio-Visual Event Parser"
☆17Jul 13, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ekazakos / grove
View on GitHub
Code implementation for the paper "Large-scale Pre-training for Grounded Video Caption Generation" (ICCV 2025)
☆31Jan 18, 2026Updated 6 months ago
xmed-lab / CLSS
View on GitHub
The official implementation of GCLSS (Generalized CLSS) and CLSS (NeurIPS 2023: Semi-Supervised Contrastive Learning for Deep Regression …
☆15Apr 11, 2026Updated 3 months ago
nttcslab / dcase2025_task4_baseline
View on GitHub
☆18Apr 16, 2026Updated 3 months ago
epic-kitchens / epic-kitchens-100-narrator
View on GitHub
Video narrator written in Python/GTK using vlc-lib
☆25Jun 22, 2022Updated 4 years ago
Sarinda251 / CDFSL-V
View on GitHub
Accepted at ICCV '23
☆16Oct 4, 2023Updated 2 years ago
stoneMo / SLAVC
View on GitHub
Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)
☆21Dec 6, 2022Updated 3 years ago
dkurzend / ClipClap-GZSL
View on GitHub
Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models
☆23Apr 15, 2024Updated 2 years ago
EGO4D / audio-visual
View on GitHub
☆69Sep 13, 2022Updated 3 years ago
MCG-NJU / BasicTAD
View on GitHub
BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection
☆52Jun 10, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WikiChao / Ego-AV-Loc
View on GitHub
[CVPR 2023] Egocentric Audio-Visual Object Localization
☆27Jan 6, 2024Updated 2 years ago
sapeirone / HiERO
View on GitHub
Official implementation of "HiERO: understanding the hierarchy of human behavior enhances reasoning on egocentric videos", accepted at IC…
☆17May 22, 2026Updated last month
ninatu / howtocaption
View on GitHub
Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024
☆58Aug 19, 2025Updated 11 months ago
Twopothead / javhoo_actresses
View on GitHub
crawl profiles of Japanese PornStars from Javhoo.com
☆12Feb 8, 2020Updated 6 years ago
cxzhou35 / notebook
View on GitHub
Zicx's Notebook.
☆11Nov 7, 2025Updated 8 months ago
jasongief / CPSP
View on GitHub
[2022 TPAMI] Contrastive Positive Sample Propagation along the Audio-Visual Event Line
☆32Mar 6, 2023Updated 3 years ago
muuda / MFF-EINV2
View on GitHub
MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection
☆22Jul 17, 2024Updated 2 years ago
stoneMo / DeepAVFusion
View on GitHub
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
☆43Aug 2, 2024Updated last year
Harper812 / FFDConv
View on GitHub
Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection
☆27May 13, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zzhhfut / CCNet-AAAI2025
View on GitHub
This repository contains code for AAAI2025 paper "Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal …
☆24Aug 18, 2025Updated 11 months ago
shengyangsun / TDSD
View on GitHub
Official repository of "TDSD: Text-Driven Scene-Decoupled Weakly Supervised Video Anomaly Detection"
☆11May 25, 2025Updated last year
hxixixh / mix-and-localize
View on GitHub
☆23Mar 20, 2024Updated 2 years ago
camenduru / FoleyCrafter-jupyter
View on GitHub
☆10Jun 28, 2024Updated 2 years ago
brown-palm / AntGPT
View on GitHub
Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
☆31Sep 23, 2024Updated last year
xiaogangpeng / HyperVD
View on GitHub
☆28Jul 1, 2023Updated 3 years ago
BUTSpeechFIT / mt-asr-data-prep
View on GitHub
☆25Feb 26, 2026Updated 4 months ago