Simple Implementation of Pix2Seq model for object detection in PyTorch
☆130Sep 2, 2023Updated 2 years ago
Alternatives and similar repositories for Pix2Seq
Users that are interested in Pix2Seq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A full-fledged version of Pix2Seq☆237Nov 6, 2021Updated 4 years ago
- Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)☆942Nov 7, 2023Updated 2 years ago
- Replication of Pix2Seq with Pretrained Model☆58Nov 6, 2021Updated 4 years ago
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆34Apr 18, 2022Updated 4 years ago
- Simple Implementation of Pix2seqV2(multi-task)☆27Dec 16, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Sep 9, 2024Updated last year
- ☆111Jun 30, 2023Updated 2 years ago
- 【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting☆17Jul 1, 2025Updated 10 months ago
- ☆27Oct 25, 2022Updated 3 years ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Sep 1, 2022Updated 3 years ago
- Unofficial implementation of Pix2SEQ☆162Oct 5, 2021Updated 4 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)☆85Nov 2, 2022Updated 3 years ago
- Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)☆35Dec 23, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".☆13Dec 21, 2023Updated 2 years ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- Open-source code for Generic Grouping Network (GGN, CVPR 2022)☆115May 8, 2026Updated last week
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago
- ☆161Jul 19, 2023Updated 2 years ago
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago
- kitti-devkit for generating the error maps, KITTI-color-space disparity maps, and pfm2uint16png and uint16png2pfm converting☆12Feb 20, 2021Updated 5 years ago
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,344Oct 5, 2023Updated 2 years ago
- The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)☆136Jul 28, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS 2024] SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow☆46Dec 1, 2024Updated last year
- Official implementation and data release of the paper "Visual Prompting via Image Inpainting".☆317Aug 7, 2023Updated 2 years ago
- ☆61Jun 17, 2022Updated 3 years ago
- 计算几何算法模板 | Computational geometry algorithm library assembled by myself, helped our team(Vegetables of Tongji) win silver medal in the IC…☆12Jul 8, 2019Updated 6 years ago
- ☆12Oct 17, 2024Updated last year
- Implementation of Pix2Seq in PyTorch☆10Feb 3, 2022Updated 4 years ago
- Image Captioning Using Transformer☆270Jun 23, 2022Updated 3 years ago
- [CVPR' 22] Towards Robust Adaptive Object Detection under Noisy Annotations☆34May 14, 2022Updated 4 years ago
- GCL implementation☆14Mar 7, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Feb 26, 2025Updated last year
- FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients☆14Jan 22, 2025Updated last year
- This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).☆245Jan 10, 2025Updated last year
- detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.☆2,292Sep 11, 2025Updated 8 months ago
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆14Jun 24, 2023Updated 2 years ago
- This is an official implementation for "Self-Supervised Learning with Swin Transformers".☆669May 13, 2021Updated 5 years ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆97Mar 10, 2025Updated last year