vinthony / academicLinks
Yet Another Academic Homepage Template
☆21Updated 2 months ago
Alternatives and similar repositories for academic
Users that are interested in academic are comparing it to the libraries listed below
Sorting:
- FunQA benchmarks funny, creative, and magic videos for challenging tasks including timestamp localization, video description, reasoning, …☆102Updated 7 months ago
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆154Updated last year
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆178Updated last year
- Supercharged BLIP-2 that can handle videos☆118Updated last year
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties☆128Updated 8 months ago
- ☆74Updated last month
- Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)☆175Updated last year
- [ICML 2024] A Touch, Vision, and Language Dataset for Multimodal Alignment☆79Updated last month
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆34Updated last year
- [ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…☆37Updated 4 months ago
- Visual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024…☆47Updated 8 months ago
- Official implementation of "Self-Improving Video Generation"☆67Updated 2 months ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆138Updated last year
- Official code repo for "Editing Implicit Assumptions in Text-to-Image Diffusion Models"☆86Updated 2 years ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆33Updated 3 months ago
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆63Updated last year
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆84Updated last year
- ☆127Updated last year
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆59Updated 11 months ago
- [CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloadin…☆226Updated 9 months ago
- Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.☆275Updated 11 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆92Updated 5 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆128Updated 2 months ago
- CAPE using text-graphs☆23Updated 3 months ago
- Personalized Representation from Personalized Generation (ICLR 2025)☆64Updated 4 months ago
- ☆62Updated last month
- Official code for MotionBench (CVPR 2025)☆49Updated 4 months ago
- [CVPR2024] MotionEditor is the first diffusion-based model capable of video motion editing.☆170Updated 3 months ago
- [CVPR 2024] Official repo for "InteractDiffusion: Interaction-Control for Text-to-Image Diffusion Model".☆121Updated last month
- ☆80Updated 7 months ago