[CVPR 2025] PyTorch implementation of Diff-II
☆25Feb 27, 2025Updated last year
Alternatives and similar repositories for Diff-II
Users that are interested in Diff-II are comparing it to the libraries listed below
Sorting:
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced eval…☆31Apr 16, 2025Updated 11 months ago
- [ICLR 2025] Official code for Combining Text-based and Drag-based Editing for Precise and Flexible Image Editing.☆20May 6, 2025Updated 10 months ago
- [ICCV 2023] GeoFormer for Homography Estimation☆35Dec 25, 2023Updated 2 years ago
- This repo implements ControlNet with DDPM and Latent Diffusion Model in PyTorch with canny edges as conditional control for Mnist and Cel…☆32Nov 25, 2024Updated last year
- [ECCV' 24 Oral] CLIFF: Continual Latent Diffusion for Open-Vocabulary Object Detection☆29Sep 26, 2024Updated last year
- 基于回译增强数据,目前整合了百度、有道、谷歌(需翻墙)翻译。☆21Nov 5, 2020Updated 5 years ago
- [TGRS] Continuous urban change detection from satellite image time series☆36Jun 10, 2025Updated 9 months ago
- ☆13Dec 11, 2023Updated 2 years ago
- Official implementation and checkpoints of GeoLink remote sensing foundation model in NeurIPS2025.☆54Oct 6, 2025Updated 5 months ago
- Reparameterizing Engineering Designs for Augmented Multi-objective Optimization☆10Feb 18, 2023Updated 3 years ago
- ☆15Jan 30, 2024Updated 2 years ago
- ☆11Jun 28, 2024Updated last year
- A convolutional neural network implemented with tensorflow and trained to recognzie the face of famous persian celebrities.☆11Feb 24, 2024Updated 2 years ago
- A helloworld project for latent diffusion models using huggingface diffusers☆15Sep 10, 2024Updated last year
- Collection of my Reinforcement Learning (RL) practices including DQN, D3QN, and Adaptive Gamma, applied to the Lunar Lander and CartPole …☆16Oct 21, 2024Updated last year
- ☆12Aug 21, 2024Updated last year
- A generative adversarial network-based model to generate synthetic RNA sequences to target proteins☆11Sep 2, 2025Updated 6 months ago
- This project classifies news articles into categories using NLP and ML techniques. It features a user-friendly web interface.☆11Oct 1, 2024Updated last year
- [ACL 2025 🔥] Time Travel is a Comprehensive Benchmark to Evaluate LMMs on Historical and Cultural Artifacts☆19May 22, 2025Updated 9 months ago
- ☆16Mar 9, 2025Updated last year
- Code for the paper "Generative Modelling of Structurally Constrained Graphs"☆17Mar 31, 2025Updated 11 months ago
- Unconditional Image Generation using a [modifiable] pretrained VQVAE based Latent Diffusion Model, adapted from huggingface diffusers.☆16Jun 12, 2024Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆30Feb 28, 2026Updated 3 weeks ago
- ☆36Apr 14, 2021Updated 4 years ago
- Unlock the potential of latent diffusion models with MNIST! 🚀 Dive into reconstructing and generating digits using cutting-edge techniqu…☆16Jan 6, 2025Updated last year
- A Deep Learning Based approach for diagnosis of Schizophrenia using EEG brain recordings☆16Apr 18, 2024Updated last year
- Tensorflow implementation of deformable conv and pooling operations.☆10Jul 17, 2017Updated 8 years ago
- ☆13Nov 28, 2021Updated 4 years ago
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆35Nov 19, 2025Updated 4 months ago
- ☆15Dec 2, 2025Updated 3 months ago
- Multi-modal categorization of Age-related Macular Degeneration (4 classes: normal, dry AMD, pcv, wet AMD)☆31Aug 12, 2022Updated 3 years ago
- ☆20Apr 26, 2024Updated last year
- ☆11Dec 6, 2024Updated last year
- [ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.☆11Jul 16, 2024Updated last year
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 8 months ago
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- Visual Instruction Tuning for Qwen2 Base Model☆41Jun 29, 2024Updated last year
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆34Jul 3, 2025Updated 8 months ago
- [ICLR 2025] Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate☆19Apr 22, 2025Updated 11 months ago