π₯ [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"
β26Feb 9, 2025Updated last year
Alternatives and similar repositories for mirage
Users that are interested in mirage are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π₯ [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospeβ¦β56Jan 22, 2026Updated 2 months ago
- π₯ [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"β40Nov 21, 2025Updated 4 months ago
- Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agentsβ103Mar 10, 2026Updated 2 weeks ago
- β13Oct 23, 2024Updated last year
- β16Oct 13, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Onesβ64Jan 26, 2026Updated 2 months ago
- β13Jun 11, 2024Updated last year
- β37Feb 4, 2026Updated last month
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Modelsβ60Jan 22, 2025Updated last year
- β18Jul 31, 2025Updated 7 months ago
- β12Dec 20, 2024Updated last year
- π₯ [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"β47Jun 16, 2024Updated last year
- [AAAI 2026] SIFThinker: Spatially-Aware Image Focus for Visual Reasoningβ23Dec 2, 2025Updated 3 months ago
- [MICCAI 2025] GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Imagesβ15Mar 12, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)β46Oct 6, 2025Updated 5 months ago
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenariosβ14Nov 19, 2024Updated last year
- π Official pytorch implementation of "D2ADA: Dynamic Density-aware Active Domain Adaptation for Semantic Segmentation. Wu et al. ECCV 20β¦β25Feb 2, 2023Updated 3 years ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMsβ65Jan 1, 2026Updated 2 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How ModelβTask Alignment Induces Divergent RL Conclusions".β17Feb 9, 2026Updated last month
- VHTestβ16Oct 31, 2024Updated last year
- β16Jun 10, 2025Updated 9 months ago
- Iterate on LLM-based structured generation forward and backwardβ22Mar 20, 2025Updated last year
- Fast, free, easy, and object-agnostic video anonymizationβ11Dec 12, 2020Updated 5 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- YesBut - Multimodal Satire Comprehension Datasetβ18Oct 23, 2024Updated last year
- Measuring the Signal to Noise Ratio in Language Model Evaluationβ29Aug 19, 2025Updated 7 months ago
- β66Jun 27, 2024Updated last year
- [ICML 2025] Repository for M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Predictive Embedding Architectureβ23Mar 13, 2026Updated last week
- βοΈ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".β53Feb 23, 2026Updated last month
- Official This-Is-My Dataset published in CVPR 2023β16Jul 18, 2024Updated last year
- (ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"β47Jul 1, 2025Updated 8 months ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"β11Oct 11, 2024Updated last year
- Modality GapβDriven Subspace Alignment Training Paradigm For Multimodal Large Language Modelsβ55Feb 23, 2026Updated last month
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β29Sep 2, 2025Updated 6 months ago
- Getting human perception scores from street-level imageryβ23Jul 17, 2024Updated last year
- Implementation of Baseline for Scene Text-to-Scene Text Translationβ19Mar 30, 2025Updated 11 months ago
- β19Jun 10, 2025Updated 9 months ago
- π· Python package and CLI utility to create photo mosaics - now with GPU supportβ17Mar 6, 2026Updated 2 weeks ago
- Top-K Deep Video Analytics: A Probabilistic Approachβ13Jul 21, 2022Updated 3 years ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing theirβ¦β21Jan 11, 2026Updated 2 months ago