yanghu819 / trigflowLinks
unofficial
☆10Updated 8 months ago
Alternatives and similar repositories for trigflow
Users that are interested in trigflow are comparing it to the libraries listed below
Sorting:
- Official Pytorch Implementation of "Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generati…☆9Updated 7 months ago
- [⭐️ WACV 2025 Oral ⭐️] PETALface: Parameter Efficient Transfer Learning for Low-resolution Face Recognition☆13Updated last month
- ☆9Updated 6 months ago
- Renderer for the Crello dataset☆9Updated 5 months ago
- SuperGS: Super-Resolution 3D Gaussian Splatting Enhanced by Variational Residual Features and Uncertainty-Augmented Learning☆10Updated last month
- [AAAI2025] Official implementation of the paper "RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Reso…☆17Updated 3 months ago
- Code for the paper "ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions" published at CVPR 2025☆15Updated 4 months ago
- 本项目主要是2025届浙江大学软件学院夏令营(AI营)的考核项目☆11Updated 4 months ago
- Benchmarking for Audio-Text and Audio-Visual Generation; Supports FAD, FD_VGG, FD_PANNs, FD_PaSST, IS_PaSST, IS_PANNs, KL_PaSST, KL_PANNs…☆21Updated 4 months ago
- This repo contains the official code release of the Neural Experts paper, published in NeurIPS 2024.☆10Updated 7 months ago
- [ACMMM 2024] Consistent123: One Image to Highly Consistent 3D Asset Using Case-Aware Diffusion Priors☆23Updated 8 months ago
- ☆50Updated 7 months ago
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆81Updated 7 months ago
- Towards training VQ-VAE models robustly!☆75Updated 6 months ago
- SAVEn-Vid: Synergistic Audio-Visual Integration for Enhanced Understanding in Long Video Context☆5Updated 6 months ago
- ☆17Updated 7 months ago
- (CVPR 2025 Highlight) Official repository of paper "AODRaw: Towards RAW Object Detection in Diverse Conditions" (https://arxiv.org/pdf/24…☆13Updated 3 months ago
- ☆22Updated 2 weeks ago
- [ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding☆9Updated 3 months ago
- Code, Resources - Personal project - Llama Paper Summary - October 14, 2024.☆11Updated 9 months ago
- ☆13Updated 8 months ago
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Updated 8 months ago
- ☆11Updated 3 months ago
- ☆13Updated 4 months ago
- Stability-AI's SV3D (ECCV 2024 oral, Voleti et al.) in the diffusers convention.☆23Updated 5 months ago
- LLaVA-MR: Large Language-and-Vision Assistant for Video Moment Retrieval☆8Updated 7 months ago
- [WIP🚧] 2025 up-to-date list of resources on visual tokenizers (primarily for visual generation). Give it a star 🌟 if you find it useful…☆14Updated 6 months ago
- ☆25Updated 3 months ago
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆38Updated 3 months ago
- ☆8Updated 7 months ago