[π IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound Separation from Diverse Categories"
β28Nov 1, 2025Updated 4 months ago
Alternatives and similar repositories for DAVIS
Users that are interested in DAVIS are comparing it to the libraries listed below
Sorting:
- β20May 11, 2025Updated 9 months ago
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesisβ35Feb 15, 2024Updated 2 years ago
- [AAAI 2025] Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understandingβ34Mar 21, 2025Updated 11 months ago
- β24Jul 15, 2024Updated last year
- β24Nov 1, 2024Updated last year
- Source code for <Sequence-Level Training for Non-Autoregressive Neural Machine Translation>.β24Jan 17, 2022Updated 4 years ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Predictionβ29May 26, 2024Updated last year
- β37Jun 20, 2025Updated 8 months ago
- β37May 28, 2025Updated 9 months ago
- Debiasing Through Data Attributionβ12May 23, 2024Updated last year
- Combined InstantIDπ₯ and FouriScale to generate high resolution image!β11Apr 3, 2024Updated last year
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 editionβ10Oct 31, 2022Updated 3 years ago
- β10Jul 7, 2025Updated 7 months ago
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV β¦β24Dec 4, 2025Updated 2 months ago
- Official codebase for "Context Aware Deep Learning for Multi Modal Depression Detection" [ICASSP 2019, Oral]β11Dec 26, 2024Updated last year
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"β82Oct 15, 2025Updated 4 months ago
- β17Jul 24, 2025Updated 7 months ago
- β14Jun 2, 2025Updated 9 months ago
- Reinforcing Text-Rich Video Reasoning with Visual Ruminationβ27Nov 24, 2025Updated 3 months ago
- β10Jun 13, 2022Updated 3 years ago
- Code for WACV24 work for multiview acoustic-visual detectionβ13Mar 22, 2024Updated last year
- Audio-Visual Perception of Omnidirectional Video for Virtual Reality Applicationsβ15Feb 22, 2023Updated 3 years ago
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containeβ¦β15Jan 9, 2025Updated last year
- A CUDA powered audio decoding framework for FLAC.β11May 22, 2018Updated 7 years ago
- Probabilistic Finite Volume Method based on Affine Gaussian Process inferenceβ11Jun 10, 2024Updated last year
- Analyse and Design Deep Neural Network, Dr.Kalhor, University of Tehranβ11Feb 18, 2024Updated 2 years ago
- β11Nov 5, 2021Updated 4 years ago
- β14Dec 25, 2024Updated last year
- Source code of the paper "The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields".β17Mar 3, 2025Updated 11 months ago
- Colab notebooks exploring different Machine Learning topics.β16Apr 2, 2022Updated 3 years ago
- IBM Quantum Challenge Fall 2023β10May 23, 2023Updated 2 years ago
- The public reproducible analysis code used for the gaze projectβ11Feb 21, 2026Updated last week
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.β10Feb 8, 2025Updated last year
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy custβ¦β14Feb 13, 2024Updated 2 years ago
- CAD - Memory Efficient Convolutional Adapter for Segment Anythingβ12Oct 4, 2024Updated last year
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?β15Jun 3, 2025Updated 8 months ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoniβ¦β15Oct 31, 2025Updated 4 months ago
- Official Implementation of the Paper:Motion-example-controlled Co-speech Gesture Generation Leveraging Large Language Models (Siggraph 20β¦β24Dec 18, 2025Updated 2 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.β12Jan 2, 2025Updated last year