McGill-NLP / AURORAView external linksLinks
Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation
☆33Jun 30, 2025Updated 7 months ago
Alternatives and similar repositories for AURORA
Users that are interested in AURORA are comparing it to the libraries listed below
Sorting:
- bulk image downloader freeware, reddit bulk image downloader, bulk image downloader extension, bulk image downloader from url, bulk image…☆25Aug 26, 2025Updated 5 months ago
- This repo contains all the codes for SEScore implementation☆15Mar 3, 2025Updated 11 months ago
- ☆19Sep 8, 2025Updated 5 months ago
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated last year
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)☆20Jan 18, 2026Updated 3 weeks ago
- ☆17Jun 20, 2025Updated 7 months ago
- Official implementation of our EMNLP 2022 paper "CPL: Counterfactual Prompt Learning for Vision and Language Models"☆35Dec 5, 2022Updated 3 years ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Jul 30, 2024Updated last year
- A browser for your agent.☆23Dec 7, 2025Updated 2 months ago
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encoders☆42Jun 10, 2025Updated 8 months ago
- This repository is the implementation of the paper "Flow-Based Image Abstraction" for course CS663 : Digital Image Processing.☆14May 9, 2022Updated 3 years ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (ICML 2025)☆53May 8, 2025Updated 9 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Nov 11, 2024Updated last year
- Entity-Driven Image Search over Multimodal Web Content (EMNLP 2023)☆26Dec 2, 2023Updated 2 years ago
- Triton Implementation of HyperAttention Algorithm☆48Dec 11, 2023Updated 2 years ago
- ☆26Jun 22, 2024Updated last year
- ☆56Aug 16, 2025Updated 6 months ago
- [AAAI 2026] Minute-Long Videos with Dual Parallelisms☆45Nov 12, 2025Updated 3 months ago
- [ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆55Jan 26, 2026Updated 3 weeks ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆100Feb 2, 2025Updated last year
- ☆24Oct 9, 2023Updated 2 years ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Apr 27, 2025Updated 9 months ago
- Checkpointable dataset utilities for foundation model training☆32Jan 29, 2024Updated 2 years ago
- ☆22Sep 28, 2023Updated 2 years ago
- Claude 3.5 Chrome Extension Prompt☆28Jul 30, 2024Updated last year
- ☆30Feb 6, 2026Updated last week
- Distributed Optimization Infra for learning CLIP models☆27Oct 3, 2024Updated last year
- [WACV 2026] An extremely simple method for validation-free efficient adaptation of CLIP-like VLMs that is robust to the learning rate.☆32Apr 17, 2025Updated 9 months ago
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Mar 15, 2024Updated last year
- [ICLR2025] IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis☆39Feb 17, 2025Updated 11 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Aug 25, 2023Updated 2 years ago
- A Massive Multi-Discipline Lecture Understanding Benchmark☆32Nov 1, 2025Updated 3 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Jun 26, 2024Updated last year
- [ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"☆33Jan 26, 2026Updated 3 weeks ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Mar 27, 2024Updated last year
- ☆33Nov 4, 2024Updated last year
- Code implementation for: From Virtual Games to Real-World Play☆46Jun 23, 2025Updated 7 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆79Dec 10, 2024Updated last year