code for A Large-scale Dataset for Audio-Language Representation Learning
☆14Sep 18, 2024Updated last year
Alternatives and similar repositories for Auto-ACD
Users that are interested in Auto-ACD are comparing it to the libraries listed below
Sorting:
- Code implementation of RP3D-Diag☆17Nov 25, 2024Updated last year
- The official repository for "One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts"☆10Aug 16, 2024Updated last year
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- ☆14Jul 1, 2024Updated last year
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Mar 4, 2025Updated 11 months ago
- The official codes for "AutoRG-Brain: Grounded Report Generation for Brain MRI".☆49Jan 6, 2026Updated last month
- A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.☆19Feb 15, 2024Updated 2 years ago
- ☆19May 19, 2024Updated last year
- [ECCV 2024 Oral] Knowledge-enhanced pretraining for computational pathology☆47Oct 1, 2025Updated 5 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Sep 1, 2023Updated 2 years ago
- The official codes for "Can Modern LLMs Act as Agent Cores in Radiology Environments?"☆28Jan 22, 2025Updated last year
- Source code for the paper 'Audio Captioning Transformer'☆57Jan 18, 2022Updated 4 years ago
- ☆27Jul 18, 2025Updated 7 months ago
- [CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation☆72Jul 25, 2023Updated 2 years ago
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"☆37Oct 11, 2024Updated last year
- 微信公众号:机器感知 | Tracking the Latest Arxiv Papers☆38Jun 5, 2025Updated 8 months ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- ☆50Apr 13, 2025Updated 10 months ago
- ☆11Nov 17, 2018Updated 7 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Updated this week
- My personal solutions to some textbook problems☆10Feb 12, 2020Updated 6 years ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆45Sep 6, 2024Updated last year
- All the tools that allow me to never ever open up Final Cut☆11Feb 16, 2025Updated last year
- ☆16Dec 21, 2023Updated 2 years ago
- benchmarks for evaluating MT models☆11Jun 26, 2024Updated last year
- [MICCAI 2024] Embracing Massive Medical Data☆19Jul 5, 2024Updated last year
- Dataset and code to reproduce the results of the paper "Evolving Structures in Complex Systems"☆11Dec 16, 2019Updated 6 years ago
- Repository for the Introduction to Machine Learning and Deep Learning course as part of the International Graduate Summer School in Mathe…☆11Aug 8, 2019Updated 6 years ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- This Node.js script automates the process of downloading and extracting source maps from websites. It uses Puppeteer to navigate web page…☆18Dec 17, 2025Updated 2 months ago
- Turn Trello into a CMS to power all your websites and apps.☆10May 12, 2018Updated 7 years ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Jan 9, 2026Updated last month
- This is the official implementation of RL-Chord (TNNLS).☆13Jan 2, 2024Updated 2 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- ☆10Aug 20, 2023Updated 2 years ago
- Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.☆10Nov 12, 2023Updated 2 years ago
- ☆13May 21, 2023Updated 2 years ago
- A simple agent powered by LLMs that performs tasks.