Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025
☆26Jan 25, 2025Updated last year
Alternatives and similar repositories for dochaystacks
Users that are interested in dochaystacks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆20Nov 4, 2025Updated 8 months ago
- 【TNNLS 2021】DONet: Dual-Octave Network for Fast MR Image Reconstruction☆11Jun 4, 2021Updated 5 years ago
- [CVPR 2026] Task-Aware Image Signal Processor for Advanced Visual Perception☆31Mar 28, 2026Updated 3 months ago
- Creative AI for Visual Art and Music slides and demos.☆11May 2, 2023Updated 3 years ago
- [WWW 2021] Target-adaptive Graph for Cross-target Stance Detection☆16Dec 15, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Jul 1, 2024Updated 2 years ago
- [CVPR'2025] URWKV: Unified RWKV Model with Multi-state Perspective for Low-light Image Restoration☆38Aug 6, 2025Updated 10 months ago
- Codebase for RecSys 2024 paper, The Elephant in the Room: Rethinking the Usage of Pre-trained Language Model in Sequential Recommendation☆19Aug 7, 2024Updated last year
- A fast parallel implementation of RNN Transducer.☆12Apr 8, 2025Updated last year
- [CIKM 2021] Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning☆21May 30, 2022Updated 4 years ago
- 【MICCAI 2024, Early Accept】Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models☆32Sep 10, 2024Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- [ICIP 2024] Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model☆16May 23, 2025Updated last year
- Artemis Speaker Tools B☆24Apr 4, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆29Aug 19, 2024Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆54Dec 12, 2024Updated last year
- Official Implementation of ICCV 2025 paper "ForgeLens: Data-Efficient Forgery Focus for Generalizable Forgery Image Detection"☆42Nov 9, 2025Updated 7 months ago
- ☆29Oct 4, 2023Updated 2 years ago
- ☆14Jun 17, 2024Updated 2 years ago
- ☆190Mar 31, 2026Updated 3 months ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆15Dec 16, 2024Updated last year
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated last year
- ☆15Oct 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [TIP 2026] Toward Generalizable Forgery Detection and Reasoning.☆21Apr 20, 2026Updated 2 months ago
- ☆29Apr 7, 2026Updated 2 months ago
- Creativity Inspired Zero-Shot Learning☆32Mar 8, 2021Updated 5 years ago
- Generator loss to reduce mode-collapse and to improve the generated samples quality.☆33Jul 3, 2019Updated 7 years ago
- ☆17Mar 31, 2024Updated 2 years ago
- ChartSum is a large scale benchmark for automatic chart to text summarization☆11Jul 20, 2023Updated 2 years ago
- TTRV: Test-Time Reinforcement Learning for Vision–Language Models (CVPR 2026)☆45Mar 8, 2026Updated 3 months ago
- ☆14Oct 19, 2022Updated 3 years ago
- The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition☆11Apr 28, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of Kronecker Attention in Pytorch☆20Sep 12, 2020Updated 5 years ago
- ☆13Jul 16, 2024Updated last year
- ☆11Jun 28, 2026Updated last week
- A tutorial of model quantization using TensorFlow☆11Aug 2, 2021Updated 4 years ago
- Official repository for the paper "Gradient-based Jailbreak Images for Multimodal Fusion Models" (https//arxiv.org/abs/2410.03489)☆20Oct 22, 2024Updated last year
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆18Jul 11, 2025Updated 11 months ago
- ☆41Apr 6, 2026Updated 2 months ago