jczhang02/MUSIC_dataset_script

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jczhang02/MUSIC_dataset_script)

jczhang02 / MUSIC_dataset_script

This repo contains script to download MUSIC dataset from youtube

☆12

Alternatives and similar repositories for MUSIC_dataset_script

Users that are interested in MUSIC_dataset_script are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
ubc-vision / TriBERT
View on GitHub
Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…
☆14Dec 9, 2021Updated 4 years ago
roudimit / MUSIC_dataset
View on GitHub
MUSIC Dataset from The Sound of Pixels (ECCV '18)
☆137Aug 12, 2022Updated 3 years ago
hxixixh / mix-and-localize
View on GitHub
☆23Mar 20, 2024Updated 2 years ago
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
birdortyedi / neural-preset-pytorch
View on GitHub
☆20Apr 15, 2023Updated 3 years ago
SandyPanda-MLDL / -Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-Models
View on GitHub
Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models
☆19Jul 8, 2025Updated last year
Twinkzzzzz / MeanSE
View on GitHub
Official implementation of 'MeanSE: Efficient Generative Speech Enhancement with Mean Flows'
☆20Oct 11, 2025Updated 9 months ago
Vekteur / probabilistic-calibration-study
View on GitHub
Implementation of "A Large-Scale Study of Probabilistic Calibration in Neural Network Regression" (ICML 2023)
☆11Oct 7, 2025Updated 9 months ago
TianyunYoung / Hallucination-Attribution
View on GitHub
This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…
☆39Jul 14, 2025Updated last year
EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
luckyerr / Voice-Transformer_Speaker-Verification
View on GitHub
Incorporating the memory mechanism into the transformer and employing a parallel weighting structure to obtain a better utterance-level r…
☆22Oct 4, 2025Updated 9 months ago
passing2961 / DialogCC
View on GitHub
Official code and dataset for our NAACL 2024 paper: DialogCC: An Automated Pipeline for Creating High-Quality Multi-modal Dialogue Datase…
☆13Jun 24, 2024Updated 2 years ago
dmhyun / MSRP
View on GitHub
Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 …
☆10May 20, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
sony / CLIPSep
View on GitHub
☆43Feb 21, 2023Updated 3 years ago
brightjade / PRiSM
View on GitHub
Source code for paper "PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration", Findings …
☆11Jun 20, 2025Updated last year
mutiann / speech_rankings
View on GitHub
A CSRankings-like index for speech researchers
☆35Oct 16, 2024Updated last year
charliie-dev / dotfiles
View on GitHub
There is no place like ~
☆19Jul 7, 2024Updated 2 years ago
sarulab-speech / DuplexChat
View on GitHub
☆47Jul 5, 2026Updated 3 weeks ago
physicsofEBM / physicsofEBM.github.io
View on GitHub
The Physics of Energy Based Models
☆17Mar 20, 2024Updated 2 years ago
FutureTwT / BSTH
View on GitHub
The source code of "Bit-aware Semantic Transformer Hashing for Multi-modal Retrieval." (Accepted by SIGIR 2022)
☆18Sep 15, 2022Updated 3 years ago
facebookresearch / facestar
View on GitHub
Facestar dataset. High quality audio-visual recordings of human conversational speech.
☆112Mar 29, 2022Updated 4 years ago
dibschat / tempAgg
View on GitHub
[ECCV 2020] Temporal Aggregate Representations for Long-Range Video Understanding
☆11Sep 13, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HKUST-KnowComp / LiveSum
View on GitHub
Codes and Datasets for the Paper: Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extracti…
☆15Jun 5, 2024Updated 2 years ago
sjenni / LCI
View on GitHub
Steering Self-Supervised Feature Learning Beyond Local Pixel Statistics. In CVPR, 2020.
☆15Jul 20, 2020Updated 6 years ago
AmirooR / IntraOrderPreservingCalibration
View on GitHub
☆11Sep 11, 2022Updated 3 years ago
d-ailin / CLIP-Guided-Decoding
View on GitHub
☆18Aug 1, 2024Updated last year
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
Tayjsl97 / RL-Chord
View on GitHub
This is the official implementation of RL-Chord (TNNLS).
☆13Jan 2, 2024Updated 2 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
Shentao-YANG / Preference_Grounded_Guidance
View on GitHub
Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).
☆17Jan 8, 2025Updated last year
kyegomez / MobileVLM
View on GitHub
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆15Mar 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zjr2000 / REVERIE
View on GitHub
[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
☆20Jul 17, 2024Updated 2 years ago
guglielmocamporese / relvit
View on GitHub
Official code of "Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision Transformer", Guglielmo Camporese, Elena…
☆21Dec 14, 2022Updated 3 years ago
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
HS-YN / PanoAVQA
View on GitHub
Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)
☆16Oct 12, 2021Updated 4 years ago
liziniu / cold_start_rl
View on GitHub
Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?
☆20Mar 9, 2025Updated last year
knowledgetechnologyuhh / gasp
View on GitHub
☆12Jun 2, 2025Updated last year
nethermanpro / ComSL
View on GitHub
☆11Oct 14, 2023Updated 2 years ago