β63Oct 12, 2025Updated 5 months ago
Alternatives and similar repositories for defacto
Users that are interested in defacto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inferenceβ10Dec 15, 2024Updated last year
- [NAACL 2025π₯] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inferenceβ18Jun 19, 2025Updated 9 months ago
- β¨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Modelβ¦β20Mar 13, 2025Updated last year
- β22Nov 27, 2025Updated 4 months ago
- [ICLR 2025] Data-Augmented Phrase-Level Alignment for Mitigating Object Hallucinationβ19Jan 27, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- β12Sep 27, 2017Updated 8 years ago
- Official code for "Federated Weakly Supervised Video Anomaly Detection with Multimodal Prompt" (AAAI2025)β25May 27, 2025Updated 10 months ago
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20β¦β15Aug 12, 2024Updated last year
- Benchmarking deep learning models for real-time object detection on various platformsβ13Jan 26, 2018Updated 8 years ago
- AutoHallusion Codebase (EMNLP 2024)β22Dec 6, 2024Updated last year
- Chest X-Ray Explainer (ChEX)β23Jan 30, 2025Updated last year
- Official PyTorch Implementation for the "What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-modβ¦β20Sep 26, 2024Updated last year
- Open MMLab Detection Toolbox and Benchmarkβ14Oct 22, 2019Updated 6 years ago
- Code implementation for the paper "Large-scale Pre-training for Grounded Video Caption Generation" (ICCV 2025)β29Jan 18, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β20Jul 22, 2024Updated last year
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaborationβ26Oct 17, 2024Updated last year
- [ECCV 2024 (Oral)] Towards Scene Graph Anticipationβ19Mar 10, 2026Updated 3 weeks ago
- β44Jul 28, 2025Updated 8 months ago
- Official implementation of paper "OED: Towards One-stage End-to-End Dynamic Scene Graph Generation".β27Mar 26, 2024Updated 2 years ago
- config files...β12Aug 30, 2020Updated 5 years ago
- Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimationβ12Oct 6, 2020Updated 5 years ago
- Note: DO NOT USE IT! THIS CODE IS PROVEN TO CONTAIN DATA LEAKAGE! Archive version of "Text Is MASS: Modeling as Stochastic Embedding for β¦β23May 1, 2025Updated 10 months ago
- Companion code to https://arxiv.org/abs/2409.03797v2β19Sep 18, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [AAAI 2025] Enhance Vision-Language Alignment with Noiseβ25Dec 19, 2024Updated last year
- The heartbeat animation indicates that the BGM is loading, please be patient and wait util the envelope appears.β32Feb 16, 2026Updated last month
- β30Aug 11, 2025Updated 7 months ago
- Repo for the EMNLP 2023 paper "A Simple Knowledge-Based Visual Question Answering"β25Dec 14, 2023Updated 2 years ago
- Counterfactual Reasoning VQA Datasetβ28Nov 23, 2023Updated 2 years ago
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Modelsβ36Nov 13, 2024Updated last year
- Unified layout planning and image generation, ICCV2025β41Jan 19, 2026Updated 2 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"β36Jul 11, 2024Updated last year
- β14Dec 8, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Annotations of key point location and vehicle orientation for VeRi-776 dataset. ICCV'17 paper: Orientation Invariant Feature Embedding anβ¦β16Dec 5, 2017Updated 8 years ago
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captiβ¦β24Jan 26, 2025Updated last year
- HallE-Control: Controlling Object Hallucination in LMMsβ32Apr 10, 2024Updated last year
- β34Mar 7, 2024Updated 2 years ago
- [CVPR25 Highlight] A ChatGPT-Prompted Visual hallucination Evaluation Dataset, featuring over 100,000 data samples and four advanced evalβ¦β32Apr 16, 2025Updated 11 months ago
- β18Oct 23, 2022Updated 3 years ago
- β25Mar 4, 2026Updated 3 weeks ago