๐ฏ Read research papers faster with AI. Resophy is an HTML-based AI paper reader with: ๐ค AI Translation & Analysis โ instantly understand structure, contributions, and results ๐ Daily arXiv Recommendations โ discover relevant papers with less noise ๐ ๏ธ Vibe Coding Oriented โ agent-friendly and easy to customize
โ202Dec 27, 2025Updated 4 months ago
Alternatives and similar repositories for Resophy
Users that are interested in Resophy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking fโฆโ21Dec 4, 2024Updated last year
- Code and data for the paper: DTSM: Toward Dense Table Structure Recognition with Text Query Encoder and Adjacent Feature Aggregatorโ12Apr 28, 2024Updated 2 years ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucinationโ14Apr 29, 2025Updated last year
- (ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformerโ78Apr 9, 2024Updated 2 years ago
- [CVPR 2025] SAM2Object: Consolidating View Consistency via SAM2 for Zero-Shot 3D Instance Segmentationโ29Jul 17, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean โข AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [AAAI2025 Oral] Predicting the Original Appearance of Damaged Historical Documentsโ110Jul 15, 2025Updated 10 months ago
- โ20Nov 16, 2025Updated 6 months ago
- Project Page for ICLR'26: CoPRS, offering training overview, inference code, and downloadable links.โ21Mar 17, 2026Updated 2 months ago
- An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Informatโฆโ53Jan 9, 2024Updated 2 years ago
- โ11Jun 3, 2025Updated 11 months ago
- VimTS: A Unified Video and Image Text Spotterโ78Nov 10, 2024Updated last year
- WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learningโ36Jun 10, 2025Updated 11 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Videoโ56Mar 28, 2026Updated last month
- [TCSVT2025] AVLTrack: Dynamic Sparse Learning for Aerial Vision-Language Trackingโ21Mar 10, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean โข AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A bag of tricks to speed up your deep learning processโ161Apr 28, 2024Updated 2 years ago
- Chroma key (green screen removal) algorithms with Pythonโ11Jul 14, 2024Updated last year
- Awesome Visual Trackingโ24Oct 3, 2025Updated 7 months ago
- OCR Annotations from Amazon Textract for Industry Documents Libraryโ103Aug 20, 2022Updated 3 years ago
- [AAAI 2025] DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimmingโ35Jun 1, 2025Updated 11 months ago
- Neural Harmonic Textures for High-Quality Primitive Based Neural Reconstructionโ97Apr 13, 2026Updated last month
- PyTorch implementation of "Efficient Motion Prompt Learning for Robust Visual Tracking" (ICML2025)โ30Dec 17, 2025Updated 5 months ago
- โ22Jun 3, 2025Updated 11 months ago
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documentsโ67May 26, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean โข AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Visual Object Tracking Paper Listโ29May 14, 2026Updated last week
- DocILE: Document Information Localization and Extraction Benchmarkโ146May 15, 2024Updated 2 years ago
- Code release for "UnSAMv2: Self-Supervised Learning Enables Segment Anything at Any Granularity"โ81Feb 1, 2026Updated 3 months ago
- Quick Long Video Understanding [TMLR2025]โ78Oct 27, 2025Updated 6 months ago
- [ICME 2025] ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereoโ16Oct 12, 2025Updated 7 months ago
- โ16May 29, 2025Updated 11 months ago
- [CVPR 2026] Official implementation of "PanoVGGT: Feed-Forward 3D Reconstruction from Panoramic Imagery". A geometry-aware Transformer frโฆโ77Apr 22, 2026Updated 3 weeks ago
- Official PyTorch implementation of the CVPR 2022 paper: "Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Diโฆโ93Sep 17, 2022Updated 3 years ago
- Color Point Cloud Map Evaluation Tool for Color Qualityโ36Dec 1, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The official implement of CTRNet++.โ15Dec 30, 2024Updated last year
- GTOT+RGBT234+RGBT210+LasHeR Evaluation Toolkitโ22Jun 19, 2023Updated 2 years ago
- โ16Jul 5, 2023Updated 2 years ago
- The official repo for โWildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?โโ73May 19, 2025Updated last year
- Code for ICCV 2023 Paper : โICL-D3IE: In-Context Learning with Diverse Demonstrations Updating for Document Information Extractionโโ54Aug 8, 2023Updated 2 years ago
- Localization of a GNSS denied UAV relative to a Moving Local Reference Frame in an Offshore Environmentโ14Sep 1, 2025Updated 8 months ago
- ๐ฎ glTF/GLB Viewer for VS Code. Interactive 3D model viewer with DRACO/KTX2 support, animations, textures, materials and detailed model iโฆโ25Oct 21, 2025Updated 7 months ago