showlab / AVA-AVDView external linksLinks
☆21Nov 24, 2022Updated 3 years ago
Alternatives and similar repositories for AVA-AVD
Users that are interested in AVA-AVD are comparing it to the libraries listed below
Sorting:
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- The source code of Tim-TSENet☆15Apr 22, 2022Updated 3 years ago
- ☆20Dec 29, 2024Updated last year
- ☆49Nov 24, 2022Updated 3 years ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆25Dec 7, 2023Updated 2 years ago
- Spot the conversation: speaker diarisation in the wild☆157Jul 26, 2022Updated 3 years ago
- PyTorch implementation of "Distinguishing Homophenes using Multi-Head Visual-Audio Memory" (AAAI2022)☆27Mar 9, 2024Updated last year
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated last month
- LaTeX template for dissertations in Peking University☆33Apr 16, 2020Updated 5 years ago
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- Android test project displaying live camera feed in a GLSurfaceView☆10Mar 8, 2015Updated 10 years ago
- Walks through building different HTML5 layouts for AV systems☆12Oct 15, 2021Updated 4 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆46Mar 29, 2024Updated last year
- ☆42Nov 22, 2024Updated last year
- ☆46Jun 26, 2025Updated 7 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆51Aug 6, 2025Updated 6 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆14Aug 6, 2025Updated 6 months ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Nov 12, 2020Updated 5 years ago
- ArXiv'18 implementation of amortized maximum likelihood (AML) for high-quality, weakly-supervised shape completion.☆11Nov 30, 2018Updated 7 years ago
- [AAAI 2024] SAAS - Official PyTorch Implementation☆11Mar 28, 2024Updated last year
- ☆11Sep 1, 2024Updated last year
- ☆10Jun 1, 2024Updated last year
- Run all the tests at the same time with modal.com☆11Mar 2, 2024Updated last year
- A guide to structured generation using constrained decoding☆13Jun 9, 2024Updated last year
- SSL Video Representation Learning project☆14Jul 8, 2025Updated 7 months ago
- Multi-Aspect Controllable Text Generation with Disentangled Counterfactual Augmentation, ACL 2024 (main)☆13Sep 23, 2024Updated last year
- Resources accompanying the "Zero-Shot Recommendation as Language Modeling" paper (ECIR2022)☆14May 25, 2023Updated 2 years ago
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆10Apr 20, 2025Updated 9 months ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 2 years ago
- ME-GraphAU on Video☆11May 10, 2024Updated last year
- Frontend (and soon also midleware and backend) for a new, opensource image generation platform.☆14Nov 5, 2022Updated 3 years ago
- A javascript library to generate the voronoi diagram and the medial axis transform for polygons with holes.☆11Sep 15, 2021Updated 4 years ago
- Official implementation of "FreeMotion: A Unified Framework for Number-free Text-to-Motion Synthesis"☆44Oct 15, 2024Updated last year
- Facetracking for How We Act Together.☆10Dec 9, 2016Updated 9 years ago
- Lista de modelos y aplicaciones basadas en diffusion☆11May 4, 2023Updated 2 years ago
- (Accepted By EMNLP2022 main long)Knowledge Prompting in Pre-trained Language Model for Natural Language Understanding☆14Oct 29, 2022Updated 3 years ago
- Target Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning☆10Jul 2, 2019Updated 6 years ago
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year