moein-shariatnia/Pix2Seq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/moein-shariatnia/Pix2Seq)

moein-shariatnia / Pix2Seq

Simple Implementation of Pix2Seq model for object detection in PyTorch

☆131

Alternatives and similar repositories for Pix2Seq

Users that are interested in Pix2Seq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gaopengcuhk / Stable-Pix2Seq
View on GitHub
A full-fledged version of Pix2Seq
☆237Nov 6, 2021Updated 4 years ago
google-research / pix2seq
View on GitHub
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
☆945Nov 7, 2023Updated 2 years ago
gaopengcuhk / Pretrained-Pix2Seq
View on GitHub
Replication of Pix2Seq with Pretrained Model
☆58Nov 6, 2021Updated 4 years ago
Sharpiless / Pix2seq-mmdetection
View on GitHub
Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection
☆34Apr 18, 2022Updated 4 years ago
JJJYmmm / Pix2SeqV2-Pytorch
View on GitHub
Simple Implementation of Pix2seqV2(multi-task)
☆26Dec 16, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
hanqiu-hq / MAD
View on GitHub
☆14Sep 9, 2024Updated last year
SwinTransformer / AiT
View on GitHub
☆111Jun 30, 2023Updated 3 years ago
dnjs3594 / Eigencontours
View on GitHub
☆27Oct 25, 2022Updated 3 years ago
HDETR / H-PETR-Pose
View on GitHub
[CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".
☆14Sep 1, 2022Updated 3 years ago
gaopengcuhk / Unofficial-Pix2Seq
View on GitHub
Unofficial implementation of Pix2SEQ
☆162Oct 5, 2021Updated 4 years ago
CASIA-LMC-Lab / Obj2Seq
View on GitHub
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks (NeurIPS2022)
☆85Nov 2, 2022Updated 3 years ago
facebookresearch / PLRC
View on GitHub
Code for Point-Level Regin Contrast (https//arxiv.org/abs/2202.04639)
☆35Dec 23, 2022Updated 3 years ago
FelixHertlein / inv3d
View on GitHub
Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".
☆13Dec 21, 2023Updated 2 years ago
DTennant / distill_visual_priors
View on GitHub
2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261
☆13Aug 22, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
facebookresearch / Generic-Grouping
View on GitHub
Open-source code for Generic Grouping Network (GGN, CVPR 2022)
☆115May 18, 2026Updated 2 months ago
DrLuo / SemiETS
View on GitHub
【CVPR 2025】SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
☆17Jul 1, 2025Updated last year
sgvaze / SSB
View on GitHub
Python package to download and use the SSB datasets
☆11Aug 3, 2023Updated 2 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
View on GitHub
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆18Sep 17, 2021Updated 4 years ago
DTennant / dual-rank-ncd
View on GitHub
Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)
☆12Aug 20, 2023Updated 2 years ago
ccj5351 / kitti-devkit
View on GitHub
kitti-devkit for generating the error maps, KITTI-color-space disparity maps, and pfm2uint16png and uint16png2pfm converting
☆12Feb 20, 2021Updated 5 years ago
microsoft / X-Decoder
View on GitHub
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
☆1,346Oct 5, 2023Updated 2 years ago
amazon-science / polygon-transformer
View on GitHub
☆164Jul 19, 2023Updated 3 years ago
cvlab-stonybrook / PaperEdge
View on GitHub
The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)
☆137Jul 28, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wang-chaoyang / SemFlow
View on GitHub
[NeurIPS 2024] SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
☆46Dec 1, 2024Updated last year
amirbar / visual_prompting
View on GitHub
Official implementation and data release of the paper "Visual Prompting via Image Inpainting".
☆319Aug 7, 2023Updated 2 years ago
Alpha-VL / FastConvMAE
View on GitHub
☆61Jun 17, 2022Updated 4 years ago
wzx99 / TMIM
View on GitHub
☆13Oct 17, 2024Updated last year
xwen99 / Computational-Geometry-Algorithm-Library
View on GitHub
计算几何算法模板 | Computational geometry algorithm library assembled by myself, helped our team(Vegetables of Tongji) win silver medal in the IC…
☆12Jul 8, 2019Updated 7 years ago
saahiluppal / catr
View on GitHub
Image Captioning Using Transformer
☆270Jun 23, 2022Updated 4 years ago
shinseung428 / pix2seq-pytorch
View on GitHub
Implementation of Pix2Seq in PyTorch
☆10Feb 3, 2022Updated 4 years ago
ming71 / GCL
View on GitHub
GCL implementation
☆14Mar 7, 2024Updated 2 years ago
CityU-AIM-Group / NLTE
View on GitHub
[CVPR' 22] Towards Robust Adaptive Object Detection under Noisy Annotations
☆34May 14, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
nailwatts / FNIN
View on GitHub
FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients
☆13Jan 22, 2025Updated last year
zhilyzhang / AQSNet
View on GitHub
On the automatic quality assessment of annotated sample data for object extraction from remote sensing imagery
☆15Jul 26, 2023Updated 2 years ago
changlin31 / AutoProg
View on GitHub
(CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers
☆25Feb 26, 2025Updated last year
chenhaoxing / DiffusionInst
View on GitHub
This repo is the code of paper "DiffusionInst: Diffusion Model for Instance Segmentation" (ICASSP'24).
☆244Jan 10, 2025Updated last year
ant-research / lumos
View on GitHub
[CVPR'25 - Rating 555] Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text
☆52Mar 16, 2025Updated last year
ayumiymk / DiG
View on GitHub
Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`
☆74Feb 27, 2023Updated 3 years ago
CVMI-Lab / SlotCon
View on GitHub
(NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping
☆98Mar 10, 2025Updated last year