Jiang-Yidi / TransformerDistillation-SLULinks
☆12Updated 3 years ago
Alternatives and similar repositories for TransformerDistillation-SLU
Users that are interested in TransformerDistillation-SLU are comparing it to the libraries listed below
Sorting:
- Code for CVPR 2024 Oral "Neural Lineage"☆17Updated last year
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Updated 2 years ago
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆28Updated last year
- [CVPR 2024] SimDA: Simple Diffusion Adapter for Efficient Video Generation☆128Updated last year
- ICCV2023-Diffusion-Papers☆108Updated last year
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆89Updated last month
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆145Updated 5 months ago
- ☆56Updated last year
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆105Updated 4 months ago
- ☆132Updated last year
- Unified Multi-modal IAA Baseline and Benchmark☆82Updated 10 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 8 months ago
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆33Updated 5 months ago
- Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】☆76Updated 7 months ago
- Unified layout planning and image generation, ICCV2025☆29Updated 3 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆52Updated 4 months ago
- Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dyna…☆181Updated 2 years ago
- Code for ECCV 2022 paper “Learning with Recoverable Forgetting”☆21Updated 3 years ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆32Updated 4 months ago
- ☆59Updated last year
- [Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions☆30Updated 6 months ago
- [CVPR 2023] Zero-shot Generative Model Adaptation via Image-specific Prompt Learning☆83Updated 2 years ago
- PyTorch implementation of One-step Diffusion with Distribution Matching Distillation☆34Updated last year
- Denoising Diffusion Step-aware Models (ICLR2024)☆61Updated last year
- ☆135Updated last year
- (SRA) No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves☆80Updated 2 weeks ago
- This is the official implementation for ControlVAR.☆117Updated 8 months ago
- [ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization☆165Updated last year
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆69Updated 9 months ago
- Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer☆16Updated 8 months ago