amosy3 / Text2Model
☆20Updated last year
Related projects: ⓘ
- An official PyTorch implementation for CLIPPR☆28Updated last year
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Gener…☆14Updated 2 months ago
- Official implementation of "Dataset Size Recovery from LoRA Weights" paper.☆29Updated 2 months ago
- Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.☆33Updated 3 months ago
- Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).☆68Updated 3 months ago
- Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units" (EMNLP 2023). https://arxiv.org/abs/2212.09730☆122Updated 9 months ago
- Official Implementation for the "Conffusion: Confidence Intervals for Diffusion Models" paper.☆135Updated last year
- Official PyTorch Implementation for the "Model Tree Heritage Recovery" paper.☆54Updated 2 months ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆12Updated 8 months ago
- Official implementation of OSSGAN [CVPR 2022]☆22Updated 2 years ago
- Real-Time Deepfake Detection in the Real-World☆20Updated 3 months ago
- A Lossless Compression Library for AI pipelines☆79Updated this week
- SIEVE: Multimodal Dataset Pruning using Image-Captioning Models (CVPR 2024)☆14Updated 4 months ago
- [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation☆16Updated 2 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆75Updated 3 months ago
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆20Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆28Updated 2 months ago
- [MM 2023] Toward High Quality Facial Representation Learning☆16Updated 10 months ago
- Evaluation script for VoxMovies dataset in PyTorch☆22Updated 8 months ago
- Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)☆15Updated last year
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆18Updated last year
- ☆25Updated last year
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆21Updated 5 months ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆25Updated 5 months ago
- DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis☆22Updated last month
- [ACM MM] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset☆65Updated 6 months ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆31Updated last year
- [CVPR 2021] Pytorch implementation of Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs☆48Updated 3 years ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆60Updated 2 months ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆19Updated 9 months ago