ORES: Open-vocabulary Responsible Visual Synthesis
☆14Dec 12, 2023Updated 2 years ago
Alternatives and similar repositories for ORES
Users that are interested in ORES are comparing it to the libraries listed below
Sorting:
- Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021☆43May 24, 2024Updated last year
- Implementation of InstructEdit☆76Oct 30, 2023Updated 2 years ago
- "Describing Textures using Natural Language" code and data, ECCV 2020 Oral.☆17Aug 6, 2020Updated 5 years ago
- NÜWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN☆40Mar 24, 2023Updated 2 years ago
- ReCo: Region-Controlled Text-to-Image Generation, CVPR 2023☆136Nov 8, 2023Updated 2 years ago
- Scalable group inference for generating high quality and diverse images with diffusion models.☆42Aug 31, 2025Updated 6 months ago
- ☆22Sep 20, 2022Updated 3 years ago
- CVPR 2022 (Oral) Pytorch Code for Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment☆22Apr 15, 2022Updated 3 years ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆28Jul 31, 2024Updated last year
- Visual Programming for Text-to-Image Generation and Evaluation (NeurIPS 2023)☆57Jul 25, 2023Updated 2 years ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆29Sep 23, 2024Updated last year
- ☆23Sep 28, 2023Updated 2 years ago
- A PyTorch implementation of LDAST☆26Dec 17, 2023Updated 2 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ☆30Nov 7, 2023Updated 2 years ago
- Generating Labeled Image Datasets using Stable Diffusion Models☆27Aug 24, 2025Updated 6 months ago
- [CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》☆62May 25, 2022Updated 3 years ago
- Implementation of MDP: A Generalized Framework for Text-Guided Image Editing by Manipulating the Diffusion Path☆67Jun 23, 2023Updated 2 years ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆77Jan 24, 2024Updated 2 years ago
- Pytorch Implementation for CVPR'2022 paper ✨ "Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pr…☆28Jul 31, 2022Updated 3 years ago
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Official implementation of "Divide & Bind Your Attention for Improved Generative Semantic Nursing" (BMVC 2023 Oral)☆37Jan 25, 2024Updated 2 years ago
- ☆33Nov 12, 2018Updated 7 years ago
- A repository for OpenHack for Lakehouse. The contents are written in Japanese.☆11Nov 20, 2023Updated 2 years ago
- An image-oriented evaluation tool for image captioning systems (EMNLP-IJCNLP 2019)☆37May 3, 2020Updated 5 years ago
- A modern audio editor with multitrack capabilities, enhanced waveform visualization, and an intuitive, sleek interface.☆17Aug 12, 2025Updated 6 months ago
- Converter from UD-trees to BART representation☆36Mar 6, 2024Updated last year
- ☆37Sep 16, 2024Updated last year
- Code for ACL 2020 paper "Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA." Hyounghun Kim, Zineng T…☆34May 14, 2020Updated 5 years ago
- A PyTorch implementation of EmpiricalMVM☆41Dec 18, 2023Updated 2 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Jul 27, 2021Updated 4 years ago
- Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022☆39Feb 17, 2023Updated 3 years ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆86Oct 26, 2025Updated 4 months ago
- rule matcher (context free grammar)☆10Dec 27, 2019Updated 6 years ago
- This is a data repository for the ACL 2020 paper: "Let Me Choose: From Verbal Context to Font Selection"☆10May 5, 2020Updated 5 years ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆43Jun 27, 2023Updated 2 years ago
- Codes for Difflare: Removing Image Flare with Latent Diffusion Models☆11Dec 24, 2024Updated last year
- A library to query heterogeneous data sources uniformly using SPARQL☆12Dec 5, 2023Updated 2 years ago
- ECG analysis to classify anterior myocardial infarction cases.☆10May 17, 2017Updated 8 years ago