☆36Jan 20, 2025Updated last year
Alternatives and similar repositories for EquiAV
Users that are interested in EquiAV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Sep 1, 2024Updated last year
- ☆17Nov 15, 2022Updated 3 years ago
- This is a codebase for I See-Through You: A Framework for Removing Foreground Occlusion in Both Sparse and Dense Light Field Images (WACV…☆18Apr 2, 2024Updated last year
- ☆28Mar 13, 2025Updated last year
- ☆28Mar 13, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆36Apr 16, 2025Updated 11 months ago
- Keras implementation of m2det object detection.☆48Aug 21, 2019Updated 6 years ago
- [CVPR2025] Official code for Lost in Translation Found in Context☆23Jan 14, 2026Updated 2 months ago
- [AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification☆12Mar 10, 2025Updated last year
- AI Development in Evolving Policy [AI DEP]☆46Jul 7, 2025Updated 8 months ago
- [NeurIPS 2025] Official implementation of "Soft Task-Aware Routing of Experts for Equivariant Representation Learning"☆33Dec 15, 2025Updated 3 months ago
- [ECCV'24] Official code for "BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation"☆42Nov 19, 2024Updated last year
- ☆18Nov 19, 2024Updated last year
- [CVPR2019]Learning Not to Learn : An adversarial method to train deep neural networks with biased data☆113May 19, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (…☆49Nov 19, 2024Updated last year
- ICML 2024, Official Implementation of "Cross-view Masked Diffusion Transformers for Person Image Synthesis."☆53Nov 5, 2024Updated last year
- ☆20Apr 18, 2024Updated last year
- ☆10Sep 25, 2024Updated last year
- [ECCV 2024] FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing☆74Aug 13, 2025Updated 7 months ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆480Sep 18, 2025Updated 6 months ago
- ☆37May 28, 2025Updated 9 months ago
- Evaluate EfficientAT models on the Holistic Evaluation of Audio Representations Benchmark.☆32Jun 23, 2023Updated 2 years ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Apr 29, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official code for "SimPSI: A Simple Strategy to Preserve Spectral Information in Time Series Data Augmentation", AAAI 2024.☆36Jan 22, 2025Updated last year
- GLPDepth PyTorch Implementation: Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth☆199Mar 8, 2024Updated 2 years ago
- Official repository of StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Model (EMNLP 2024)☆42Feb 11, 2025Updated last year
- ☆69Dec 16, 2025Updated 3 months ago
- Test-time adaptation via Nearest neighbor information (TAST), submitted to ICLR'23☆24Jul 11, 2023Updated 2 years ago
- Official Implementation of EnCLAP (ICASSP 2024)☆94Jun 2, 2024Updated last year
- ☆77Nov 3, 2025Updated 4 months ago
- Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".☆288Mar 20, 2024Updated 2 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆58Oct 8, 2025Updated 5 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆58Sep 25, 2025Updated 6 months ago
- Recent vision transformer-based domain adaptation papers☆15Mar 17, 2022Updated 4 years ago
- ☆43Feb 21, 2023Updated 3 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 3 years ago
- Efficient Training of Audio Transformers with Patchout☆371Jan 12, 2024Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Oct 13, 2023Updated 2 years ago
- SoTA open-source TTS☆23Jun 17, 2025Updated 9 months ago