Jiang-Yidi / FlatTrajectoryDistillation_FTDLinks
The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)
☆18Updated 2 years ago
Alternatives and similar repositories for FlatTrajectoryDistillation_FTD
Users that are interested in FlatTrajectoryDistillation_FTD are comparing it to the libraries listed below
Sorting:
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆52Updated 2 years ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆33Updated 2 months ago
- ☆12Updated 3 years ago
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆22Updated 10 months ago
- Keras implement of Finite Scalar Quantization☆73Updated last year
- [ICLR2022] Code for "Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph"☆54Updated 2 years ago
- [ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and s…☆118Updated last week
- Scala(NeurIPS 2024)☆10Updated 6 months ago
- ☆80Updated last week
- ☆14Updated 3 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆83Updated 11 months ago
- LMM solved catastrophic forgetting, AAAI2025☆43Updated last month
- ☆28Updated last year
- ☆25Updated 8 months ago
- ☆46Updated 2 years ago
- SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022☆115Updated 2 years ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆40Updated last year
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆39Updated 2 years ago
- Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation☆25Updated 3 years ago
- Distributed Optimization Infra for learning CLIP models☆26Updated 8 months ago
- A curated list of awesome adversarial reprogramming and input prompting methods for neural networks since 2022☆36Updated last year
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆27Updated last year
- A torch-based implementation of K-Means and K-Means++☆17Updated 4 years ago
- Official implementation for AVGN☆34Updated 2 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆52Updated last year
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆17Updated 2 months ago
- ☆9Updated 9 months ago
- ☆30Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆16Updated last year
- This repo contains script to download MUSIC dataset from youtube☆8Updated last year