PyTorch implementation of "PatchVAE: Learning Local Latent Codes for Recognition" to appear in CVPR 2020
☆14Apr 9, 2020Updated 5 years ago
Alternatives and similar repositories for PatchVAE
Users that are interested in PatchVAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Sep 21, 2022Updated 3 years ago
- A dataset collected from synchronized ad-hoc microphone arrays☆19Apr 24, 2023Updated 2 years ago
- MOMN☆16Mar 30, 2020Updated 5 years ago
- Code and datasets for 'Move2Hear: Active Audio-Visual Source Separation' (ICCV 2021)☆15Jan 17, 2023Updated 3 years ago
- ☆13Oct 12, 2020Updated 5 years ago
- ☆12Apr 16, 2024Updated last year
- PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021☆24Jun 4, 2021Updated 4 years ago
- [ICCV 2021] Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object Localization☆25Jan 29, 2022Updated 4 years ago
- Face Generation from Textual Description using GANs.☆15Apr 2, 2024Updated last year
- Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)☆17Oct 12, 2021Updated 4 years ago
- Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge☆16Feb 10, 2023Updated 3 years ago
- CVPR 2024 Official Repository☆12Mar 27, 2024Updated last year
- Official repository for the AAAI2026 paper (Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery …☆22Feb 4, 2026Updated last month
- This repository is for code to run examples and generate the figures from the lectures notes from the module Advanced Mathematical Biolog…☆10Nov 14, 2025Updated 4 months ago
- Official Repository for the paper "No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform Gestures", Findin…☆20Jun 13, 2021Updated 4 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- ☆14Jan 5, 2022Updated 4 years ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆18Feb 25, 2025Updated last year
- ☆10Oct 24, 2024Updated last year
- growing interpretable part graphs on convnets via multi-shot learning, in AAAI 2017☆16May 28, 2017Updated 8 years ago
- ☆22Jan 15, 2019Updated 7 years ago
- Class Balancing GAN with a Classifier In The Loop (UAI 2021)☆12Feb 11, 2022Updated 4 years ago
- EMIT: Enhancing MLLMs for Industrial Anomaly Detection via Difficulty-Aware GRPO☆21Jan 24, 2026Updated last month
- [NeurIPS 2023]Federated Learning with Bilateral Curation for Partially Class-Disjoint Data☆14Aug 1, 2025Updated 7 months ago
- Code to simulate a reverberated, noisy version of the WSJ-2MIX dataset☆21May 30, 2020Updated 5 years ago
- ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation☆28May 27, 2025Updated 9 months ago
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆12Oct 11, 2024Updated last year
- ☆29Jul 9, 2024Updated last year
- ☆26Jan 18, 2022Updated 4 years ago
- This repository consists of python code to train sound event localization and detection models.☆21Jan 21, 2021Updated 5 years ago
- Official Repository for the paper Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach published …☆30Jun 24, 2024Updated last year
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆18Jul 7, 2024Updated last year
- Code for paper Learning Audio-Visual Dereverberation☆31Aug 10, 2022Updated 3 years ago
- An Official Implementation for the Paper 'Point Beyond Class: A Benchmark for Weakly Semi-Supervised Abnormality Localization in Chest X-…☆18Oct 20, 2022Updated 3 years ago
- Code for CVPR 2021 paper Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing☆24Dec 29, 2021Updated 4 years ago
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- Enhanced GPUstat-web☆10Oct 2, 2020Updated 5 years ago
- Codebase for paper ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools☆30Nov 3, 2025Updated 4 months ago
- ☆13Nov 16, 2020Updated 5 years ago