Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation
β35Jun 30, 2025Updated 10 months ago
Alternatives and similar repositories for AURORA
Users that are interested in AURORA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains all the codes for SEScore implementationβ15Mar 3, 2025Updated last year
- πPytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"β34Jul 28, 2025Updated 9 months ago
- β17Mar 3, 2025Updated last year
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"β10Dec 9, 2023Updated 2 years ago
- β24May 23, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- bulk image downloader freeware, reddit bulk image downloader, bulk image downloader extension, bulk image downloader from url, bulk imageβ¦β25Feb 19, 2026Updated 2 months ago
- Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022β31May 29, 2023Updated 2 years ago
- β17Jun 20, 2025Updated 10 months ago
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)β20Jan 18, 2026Updated 3 months ago
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024β11Nov 7, 2024Updated last year
- Official Code for "Intelligent Painter: Picture Composition With Resampling Diffusion Model" (ICIP 2023)β17Jun 23, 2023Updated 2 years ago
- LEO: A powerful Hybrid Multimodal LLMβ20Jan 18, 2025Updated last year
- MAM: ModularMulti-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaborationβ46Apr 3, 2026Updated last month
- Visual Storytelling post-edit datasetβ18Sep 27, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- C/C++ -- Patchmatch/Graphcutβ14Jan 3, 2014Updated 12 years ago
- β120Jan 27, 2025Updated last year
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"β35Dec 5, 2022Updated 3 years ago
- Code to prepare data and reproduce results from CableInspect-AD paperβ15Aug 27, 2024Updated last year
- Code and dataset for "Detecting Human Artifacts from Text-to-Image Models"β50Dec 26, 2024Updated last year
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)β56May 8, 2025Updated last year
- [NeurIPS 2024] Visual Perception by Large Language Modelβs Weightsβ56Mar 31, 2025Updated last year
- This is a list of the best cheat sheets I have found for software engineering, data science and machine learning.β13Mar 5, 2020Updated 6 years ago
- (CVPR 2024) π§© TokenCompose: Text-to-Image Diffusion with Token-level Supervisionβ137Dec 21, 2024Updated last year
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Task management for AI agentsβ15Jun 25, 2025Updated 10 months ago
- Entity-Driven Image Search over Multimodal Web Content (EMNLP 2023)β26Dec 2, 2023Updated 2 years ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectoriesβ44Aug 7, 2025Updated 9 months ago
- A Massive Multi-Discipline Lecture Understanding Benchmarkβ34Apr 20, 2026Updated 2 weeks ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"β33Mar 15, 2024Updated 2 years ago
- β10Aug 22, 2022Updated 3 years ago
- Implementations of the renormalization group-based diffusion model (RGDM).β16Mar 10, 2025Updated last year
- Official PyTorch code of GroundVQA (CVPR'24)β64Sep 13, 2024Updated last year
- β24Oct 9, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β10Oct 1, 2019Updated 6 years ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesisβ62Mar 31, 2026Updated last month
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedbackβ12Jul 13, 2022Updated 3 years ago
- Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuningβ237Jan 22, 2026Updated 3 months ago
- Triton Implementation of HyperAttention Algorithmβ48Dec 11, 2023Updated 2 years ago
- Interface for GenAI-Arena [NeurIPS24]β17Feb 27, 2024Updated 2 years ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!β25Nov 23, 2024Updated last year