yunyikristy/global_local

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yunyikristy/global_local)

yunyikristy / global_local

☆14

Alternatives and similar repositories for global_local

Users that are interested in global_local are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yunyikristy / CM-ACC
View on GitHub
Cross-model active contrastive coding
☆22Mar 17, 2021Updated 5 years ago
Yu-Wu / Modaily-Aware-Audio-Visual-Video-Parsing
View on GitHub
Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing
☆24Dec 29, 2021Updated 4 years ago
DAVEISHAN / TimeBalance
View on GitHub
Placeholder
☆10Jul 17, 2023Updated 3 years ago
rgb91 / temporal-deepfake-segmentation
View on GitHub
Transformer Model to detect deepfakes from popular datasets. Predictions made on embeddings (features) generated by a different ViT model…
☆14Nov 27, 2023Updated 2 years ago
sangho-vision / avbert
View on GitHub
☆31Sep 20, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
GenjiB / LAVISH
View on GitHub
Vision Transformers are Parameter-Efficient Audio-Visual Learners
☆107Aug 11, 2023Updated 2 years ago
JackSyu / Discriminative-Multi-modality-Speech-Recognition
View on GitHub
TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"
☆26Apr 27, 2022Updated 4 years ago
airsplay / vimpac
View on GitHub
☆73Jun 3, 2022Updated 4 years ago
volatileee / FacialPulse
View on GitHub
☆13Dec 2, 2024Updated last year
ws-jiang / awesome-sharpeness-aware-minimization
View on GitHub
☆11Jun 20, 2023Updated 3 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
marmot-xy / CMBS
View on GitHub
cross modal background suppression for audio-visual event localization
☆36Mar 18, 2022Updated 4 years ago
xuyingzhongguo / deepfake_supcon
View on GitHub
☆15May 18, 2024Updated 2 years ago
circle-hit / MuCDN
View on GitHub
Code for COLING 2022 accepted paper titled "MuCDN: Mutual Conversational Detachment Network for Emotion Recognition in Multi-Party Conver…
☆10Jul 21, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / MAViL
View on GitHub
The repo host the code and model of MAViL.
☆45Jul 24, 2023Updated 3 years ago
ubc-vision / TriBERT
View on GitHub
Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…
☆14Dec 9, 2021Updated 4 years ago
ms-dot-k / Visual-Audio-Memory
View on GitHub
PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)
☆22Apr 11, 2022Updated 4 years ago
itsyoavshalev / Image-Animation-with-Perturbed-Masks
View on GitHub
Image Animation with Perturbed Masks
☆12Jun 6, 2022Updated 4 years ago
tmlr-group / G-effect
View on GitHub
[ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"
☆16Feb 27, 2025Updated last year
YuanGongND / uavm
View on GitHub
Code for the IEEE Signal Processing Letters 2022 paper "UAVM: Towards Unifying Audio and Visual Models".
☆57Apr 20, 2023Updated 3 years ago
ictnlp / LSG
View on GitHub
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
☆15Jan 3, 2025Updated last year
azuxmioy / fpvsum
View on GitHub
FPVSum : First-Person Video Summarization dataset
☆12Aug 31, 2018Updated 7 years ago
facebookresearch / MoCA
View on GitHub
Motion-conditional image animation for video editing
☆20Dec 2, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
i-need-sleep / mad
View on GitHub
☆16Sep 29, 2025Updated 10 months ago
qiuchili / diasenti
View on GitHub
Conversational Multimodal Emotion Recognition
☆12Dec 7, 2020Updated 5 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
LutingWang / HEAD
View on GitHub
HEtero-Assists Distillation for Heterogeneous Object Detectors
☆10Jul 3, 2023Updated 3 years ago
JongSuk1 / AVCap
View on GitHub
☆11Sep 1, 2024Updated last year
GANG370 / FedForgery
View on GitHub
☆31Jan 20, 2024Updated 2 years ago
samuelyu2002 / PACS
View on GitHub
Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)
☆18Dec 20, 2022Updated 3 years ago
Ankh1234 / DeepSyncNet
View on GitHub
This is the open source code of 《DeepSyncNet:Deep Synchronized Fusion Network for EEG-fNIRS Multimodal Brain-Computer Interfaces》
☆18Mar 17, 2026Updated 4 months ago
TencentAILabHealthcare / IIB-MIL
View on GitHub
☆11Jul 21, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rajnish-aggarwal / Emotion-recognition-using-audio-and-video-on-RAVDES-dataset
View on GitHub
☆12May 19, 2019Updated 7 years ago
CeeZh / SILVR
View on GitHub
Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"
☆19Jan 18, 2026Updated 6 months ago
WalkerMitty / Fast-Llama2
View on GitHub
Fast instruction tuning with Llama2
☆10Apr 8, 2024Updated 2 years ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
lm495455 / PTH-Net
View on GitHub
This is an official implementation in PyTorch of PTH-Net: Dynamic Facial Expression Recognition without Face Detection and Alignment..
☆17Jul 1, 2025Updated last year
RoyiRa / GRADE-Quantifying-sample-diversity-in-text-to-image-models
View on GitHub
☆12Mar 5, 2025Updated last year
ms-dot-k / LRW_ID
View on GitHub
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…
☆10Oct 12, 2023Updated 2 years ago