OoDBag / VisTAView external linksLinks
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
☆22May 31, 2025Updated 8 months ago
Alternatives and similar repositories for VisTA
Users that are interested in VisTA are comparing it to the libraries listed below
Sorting:
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆14Nov 20, 2025Updated 2 months ago
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆57Oct 28, 2024Updated last year
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆45Jul 22, 2025Updated 6 months ago
- Code for ICLR'24 workshop ME-FoMo-How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation☆38Oct 18, 2024Updated last year
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆38Jan 25, 2024Updated 2 years ago
- ☆24Aug 19, 2025Updated 5 months ago
- Automatic stabilizing and auto-piloting system for RC flying wing☆14Mar 3, 2016Updated 9 years ago
- Official repository for the paper "On the use of Benford's law to detect GAN-generated images", ICPR2020☆13Apr 7, 2021Updated 4 years ago
- A large scale inpainting & t2i anime image dataset☆14Oct 18, 2025Updated 3 months ago
- Directed masked autoencoders☆14Feb 5, 2026Updated last week
- A simple exam generator and grader written in Python with OpenCV☆14Jan 14, 2026Updated last month
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- ☆20Aug 8, 2025Updated 6 months ago
- 2024年第六届全球校园人工智能算法精英大赛AI生成人脸图像鉴别☆15May 30, 2025Updated 8 months ago
- Perception Matters: Exploring Imperceptible and Transferable Anti-forensics for GAN-generated Fake Face Imagery Detection☆11Jan 23, 2023Updated 3 years ago
- ☆10Apr 7, 2025Updated 10 months ago
- OpenSRH is the first ever publicly available stimulated Raman histology (SRH) dataset and benchmark, which will facilitate the clinical t…☆13Oct 13, 2022Updated 3 years ago
- 💀 gigasmol: a lightweight wrapper for gigachat api model for seamless use with smolagents.☆15Oct 23, 2025Updated 3 months ago
- Code for paper: Reinforced Vision Perception with Tools☆69Oct 3, 2025Updated 4 months ago
- [IJCAI'25 Workshop Oral] The 1st place solution of IJCAI 2025 challenge track 1: Image Detection and Localization☆32Dec 4, 2025Updated 2 months ago
- Surrogate Modeling of the Aerodynamic Performance for Transonic Regime☆13Feb 12, 2024Updated 2 years ago
- The GPT-4 function calls used in everchanging quest for the HF game jam☆10Jul 9, 2023Updated 2 years ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated this week
- [AAAI 2025 Oral] ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks https://arxiv.org/…☆10Jun 25, 2025Updated 7 months ago
- ☆12Jul 24, 2024Updated last year
- Risky Object Localization (ROL) in a Driving Scene Dataset☆15Dec 24, 2023Updated 2 years ago
- ☆11Apr 3, 2023Updated 2 years ago
- mouse pet-ct image segmentation☆12Feb 19, 2023Updated 2 years ago
- ☆12Feb 2, 2024Updated 2 years ago
- Official code for the paper "Adversarial Magnification to Deceive Deepfake Detection through Super Resolution"☆12Jun 26, 2023Updated 2 years ago
- Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"☆13Jun 1, 2022Updated 3 years ago
- ☆20Sep 23, 2025Updated 4 months ago
- Official repository of "Beyond Spatial Frequency: Pixel-wise Temporal Frequency-based Deepfake Video Detection" [ICCV 2025]☆20Jan 17, 2026Updated 3 weeks ago
- [CVPR'25] Official code of paper "Mimic In-Context Learning for Multimodal Tasks"☆24Jun 8, 2025Updated 8 months ago
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- ☆27Jan 5, 2026Updated last month
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- Python client to integrate Cleanlab Codex with your AI Agent☆19Nov 19, 2025Updated 2 months ago
- Large-scale text embedding model☆38Sep 6, 2025Updated 5 months ago