Simple Implementation of Pix2Seq model for object detection in PyTorch
☆130Sep 2, 2023Updated 2 years ago
Alternatives and similar repositories for Pix2Seq
Users that are interested in Pix2Seq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A full-fledged version of Pix2Seq☆237Nov 6, 2021Updated 4 years ago
- Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)☆942Nov 7, 2023Updated 2 years ago
- Replication of Pix2Seq with Pretrained Model☆58Nov 6, 2021Updated 4 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆34Apr 18, 2022Updated 4 years ago
- Simple Implementation of Pix2seqV2(multi-task)☆27Dec 16, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Sep 9, 2024Updated last year
- ☆111Jun 30, 2023Updated 2 years ago
- 【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting☆16Jul 1, 2025Updated 9 months ago
- ☆27Oct 25, 2022Updated 3 years ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Sep 1, 2022Updated 3 years ago
- Unofficial implementation of Pix2SEQ☆162Oct 5, 2021Updated 4 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Nov 2, 2022Updated 3 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Dec 23, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".☆13Dec 21, 2023Updated 2 years ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- ☆162Jul 19, 2023Updated 2 years ago
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,343Oct 5, 2023Updated 2 years ago
- The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)☆136Jul 28, 2024Updated last year
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆317Aug 7, 2023Updated 2 years ago
- ☆60Jun 17, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 计算几何算法模板 | Computational geometry algorithm library assembled by myself, helped our team(Vegetables of Tongji) win silver medal in the IC…☆12Jul 8, 2019Updated 6 years ago
- ☆12Oct 17, 2024Updated last year
- Implementation of Pix2Seq in PyTorch☆10Feb 3, 2022Updated 4 years ago
- [CVPR' 22] Towards Robust Adaptive Object Detection under Noisy Annotations☆34May 14, 2022Updated 3 years ago
- GCL implementation☆14Mar 7, 2024Updated 2 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Feb 26, 2025Updated last year
- FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients☆14Jan 22, 2025Updated last year
- This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).☆245Jan 10, 2025Updated last year
- detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.☆2,285Sep 11, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆14Jun 24, 2023Updated 2 years ago
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".☆667May 13, 2021Updated 4 years ago
- Object Recognition Datasets and Challenges: A Review☆11Feb 24, 2022Updated 4 years ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆97Mar 10, 2025Updated last year
- [CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text☆53Mar 16, 2025Updated last year
- Geometric Rectification of Document Images using Adversarial Gated Unwarping Network☆25Apr 30, 2020Updated 6 years ago
- [ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)☆2,250Dec 22, 2022Updated 3 years ago