☆21Nov 24, 2022Updated 3 years ago
Alternatives and similar repositories for AVA-AVD
Users that are interested in AVA-AVD are comparing it to the libraries listed below
Sorting:
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆59Jan 24, 2024Updated 2 years ago
- ☆20Dec 29, 2024Updated last year
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆25Dec 7, 2023Updated 2 years ago
- Spot the conversation: speaker diarisation in the wild☆157Jul 26, 2022Updated 3 years ago
- PyTorch implementation of "Distinguishing Homophenes using Multi-Head Visual-Audio Memory" (AAAI2022)☆27Mar 9, 2024Updated 2 years ago
- ☆32May 3, 2024Updated last year
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated 2 months ago
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Android test project displaying live camera feed in a GLSurfaceView☆10Mar 8, 2015Updated 11 years ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆46Mar 29, 2024Updated last year
- ☆43Feb 21, 2023Updated 3 years ago
- ☆42Nov 22, 2024Updated last year
- Multimodal Empathetic Chatbot☆54Jul 16, 2024Updated last year
- ☆48Jun 26, 2025Updated 8 months ago
- ME-GraphAU on Video☆11May 10, 2024Updated last year
- Respondent Video Capture Kit – PlayCorder☆10Jan 24, 2017Updated 9 years ago
- Facetracking for How We Act Together.☆10Dec 9, 2016Updated 9 years ago
- Lista de modelos y aplicaciones basadas en diffusion☆11May 4, 2023Updated 2 years ago
- A Python package for PME (Public Market Equivalent) calculation☆13Jan 16, 2026Updated last month
- Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation, ACL 2024 (main)☆13Sep 23, 2024Updated last year
- HENU Survival Handbook/Study Abroad Handbook (河大生存/飞跃手册)☆15Feb 8, 2026Updated last month
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆14May 25, 2023Updated 2 years ago
- ☆11Jun 1, 2024Updated last year
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- A conversational recommender system dataset with high-quality explanations.☆11Apr 26, 2023Updated 2 years ago
- SSL Video Representation Learning project☆14Jul 8, 2025Updated 8 months ago
- 基于python实现的桌面视频动态壁纸引擎☆10Jun 2, 2022Updated 3 years ago
- (Accepted By EMNLP2022 main long)Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding☆15Oct 29, 2022Updated 3 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Aug 6, 2025Updated 7 months ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Nov 22, 2022Updated 3 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- ☆11Sep 1, 2024Updated last year
- ☆13Jun 7, 2025Updated 9 months ago
- [AAAI 2024] SAAS - Official PyTorch Implementation☆11Mar 28, 2024Updated last year
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 2 years ago