kkyuhun94 / dalda
[ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling
☆27Updated last week
Related projects ⓘ
Alternatives and complementary repositories for dalda
- Official Implementation of KnobGen: Controlling the Sophistication of Artwork in Sketch-Based Diffusion Models☆37Updated 2 weeks ago
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆31Updated this week
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆15Updated 2 weeks ago
- Open-source Python toolkit focused on deep learning with ordinal methodologies☆31Updated last week
- AAPL: Adding Attributes to Prompt Learning for Vision-Language Models (CVPRw 2024)☆30Updated 6 months ago
- we propose FlexEdit, an end-to-end image editing method that leverages both free-shape masks and language instructions for Flexible Editi…☆24Updated 2 months ago
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆30Updated 2 weeks ago
- ☆58Updated 4 months ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆35Updated 5 months ago
- [EMNLP 2024] Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagati…☆84Updated 4 months ago
- The code of RouterDC☆27Updated last month
- [ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"☆14Updated 3 weeks ago
- ☆50Updated 2 weeks ago
- ☆49Updated 3 weeks ago
- ☆25Updated 2 months ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Updated 6 months ago
- TensorFlow code for our ECCV'24 Workshop paper "LightAvatar: Efficient Head Avatar as Dynamic NeLF"☆22Updated this week
- The first dense retrieval model that can be prompted like an LM☆62Updated last month
- VTG-GPT: Tuning-Free Zero-Shot Video Temporal Grounding with GPT☆70Updated 2 months ago
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆178Updated last month
- Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models☆122Updated 2 weeks ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆14Updated 2 weeks ago
- LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks☆40Updated last month
- [NeurIPS 2024] Official PyTorch implementation code for realizing the technical part of Mamba-based traversal of rationale (Meteor) to im…☆102Updated 5 months ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Updated 4 months ago
- Official Code for paper "Towards Efficient and Effective Unlearning of Large Language Models for Recommendation" (Frontiers of Computer S…☆34Updated 3 months ago
- ☆55Updated 3 months ago
- ☆11Updated last year
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆26Updated 2 months ago
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆72Updated this week